arXiv: 1507.07139v2 [stat.AP] 31 Mar 2016 


Nonparametric volatility estimation in scalar diffusions: 
Optimality across observation frequencies. 

Jakub Chorowski* 


Institute of Mathematics 
Humboldt-Universitat zu Berlin 
chorowsj@math.hu-berlin.de 


Abstract 

The nonparametric volatility estimation problem of a scalar diffusion process observed at 
equidistant time points is addressed. Using the spectral representation of the volatility in 
terms of the invariant density and an eigenpair of the infinitesimal generator the first known 
estimator that attains the minimax optimal convergence rates for both high and low-frequency 
observations is constructed. The proofs are based on a posteriori error bounds for generalized 
eigenvalue problems as well as the path properties of scalar diffusions and stochastic analysis. 

The finite sample performance is illustrated by a numerical example. 
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1 Introduction 

Consider the problem of estimating the volatility of a diffusion process (X t , t > 0). The statis¬ 
tical properties depend, essentially, on the observation scheme. It is natural to assume discrete 
observations: 

X 0 ,X A ,...,X NA , A > 0, T = IVA. 

The quality of an estimator is typically assessed by its asymptotic properties when the sample 
size N tends to infinity. The usual assumptions are either A —> 0 or T —> oo, which corresponds 
to high and low-frequency regimes, respectively. Different frequency assumptions require very 
different methods. Since the frequency regimes are a theoretical construct, for any given sample, 
we need to choose among high and low-frequency estimators. Therefore, it is of crucial interest to 
develop universal methods that will perform at optimal level regardless of the sampling frequency. 
In this paper, the first nonparametric estimator of the volatility that attains minimax optimal rates 
in both high and low-frequency regimes is introduced. In the parametric setting, the problem of 
the universal scale estimation was first raised in Jacobsen fl5 . 0- The constructed estimators 
were consistent and asymptotically Gaussian for all values of A, but nearly efficient for small 
values of A only. The estimation method, which relied on the use of the estimating functions, is 
different from the one applied in this paper. 
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It is a well-known consequence of the Girsanov theorem that when T is fixed, the drift coefficient 
is not identifiable. Since we are interested in a universal scale method, we focus on the volatility 
estimation and, henceforth, treat drift as a nuisance parameter. 

The existing high-frequency estimators (see Florens-Zmirou Hoffmann (li|, Jacod fl7], 
Bandi and Phillips i) are based on the interpretation of the squared volatility as the instan¬ 
taneous conditional variance of the process. Consequently, the assumption A —> 0 is crucial for 
the consistency of these estimators, see (§} and [27| Section 3]. On the other hand, it has been 
conjectured that the minimax optimal low-frequency estimator introduced by Gobet, Hoffmann 
and Reifi (GHR) 0 also performs well in the high-frequency regime. This conjecture is based on 
the observation that the spectral representation of the volatility in terms of an eigenpair of the 
infinitesimal generator can be generalized by replacing the invariant density with the occupation 
density of the path (X t ,t < T). While this generalization might be sufficient to obtain the consis¬ 
tency of the GHR estimator when applied to the high-frequency data, the numerical study reveals 
that the convergence rates are not optimal. The reason for this is that when the time horizon of 
the sample is fixed, the estimator inherits the poor regularity of the occupation density, which, 
contrary to the invariant density, is not linked to the regularity of the diffusion coefficients. As we 
show below, this difficulty can be solved with the appropriate averaging of the spectral estimator, 
which is the main motivation behind the Definition [G] of the universally optimal estimator. For 
more details, refer to section O 

Based on the spectral method, the low-frequency analysis of the universally optimal estimator 
is similar to ll|, [5j. The real difficulty is in the high-frequency analysis, where the universal 


estimator is compared to the benchmark high-frequency estimator introduced by Florens-Zmirou 
[9] (see Section f2~?l) . In particular, we develop the perturbation theory for bilinear coercive forms 
with Holder regular coefficients (see Appendix [Bj , which may be of independent interest. 

In the next Sections, we present the construction of the universal scale estimator and state the 
high and low-frequency convergence rates. In Section [2] we discuss the relation of the proposed 
estimator to the high and low-frequency benchmark estimators. Finite sample behaviour of the 
new estimator compared with the Florens-Zmirou and GHR estimators is illustrated in Section 
12.31 In Section l2~H we discuss the assumptions and possible extensions of the model. The proofs 
of the high and low-frequency convergence rates are shown in Sections [3] and 01 respectively. 


1.1 Construction of the estimator 

We follow the low-frequency literature (HUH and consider a diffusion model on [0,1] with 
boundary reflection (see Sectionfor a discussion of the model). Let || • ||oo denote the supremum 
norm on space D([0,1]) of bounded measurable functions on [0,1]. Finally, denote by 

H l = j/ £ L 2 ([0,1]) : / has i weak derivatives with /h) £ L 2 ([0,1]), j < < j 

the L 2 —Sobolev spaces on [0,1] of order * = 1,2. H l is a Hilbert space with the norm 

ii/ii^ = £ii/ (i) iu- 

j<i 


Assumption 1. For given constants 0 < d < D suppose ( a , b) £ 0, where 

0 := 0(d, D) = {(a, b) £ ^([0,1]) x D([0,1]) : \\b\\ x V ||a 2 || ff i < D, inf a 2 (x) > d}. 

:r(. 0.1 

Let the process (X t ,t > 0) be given by the following Skorokhod type stochastic differential 
equation: 

dX t = b(X t )dt + <j(X t )dWt + dK t , (1) 

X t S [0,1] for every t > 0, 
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where (Wt, t > 0) is a standard Brownian motion and ( K t , t > 0) is an adapted continuous process 
with finite variation, starting from 0, such that for every i>0we have f n l(o,i){X s )dK s = 0. The 
Sobolev regularity of a ensures that the SDE 0 has a unique strong solution, see (28l . Theorem 
4]. As shown in [ll|, X admits an invariant measure with Lebesgue density. 

Assumption 2. The initial condition Xq is distributed with respect to the invariant measure p on 
[0,1], independently of the driving Brownian motion W. 

Under Assumption [5J the diffusion X is stationary and ergodic. We denote with P CTj b the law 
of X on the canonical space U of continuous functions over the positive axis with values in [0,1], 
equipped with the topology of the uniform convergence on compact sets and endowed with its 
a —field J-. We denote with E^.j, the corresponding expectation operator. 

Definition 3. Denote by pn the empirical measure associated to the observed sample: 


1 1 *-i 1 

= 2jv«{x„l + ^2 ^ a »a} + 5 

n— 1 


2 N 


{- 1 C»a)' 


The underweighting of the first and the last observations is asymptotically negligible, but has 
meaningful finite sample interpretation both in the low and high-frequency regimes (see remarks 
before the equation © and after Definition m By ergodicity, when the time horizon T of 
the observed sample grows to infinity, the empirical measure p^^dx) converges weakly to the 
stationary distribution p{dx). When T is fixed, but the observation frequency increases, the 
empirical measure tends to the occupation measure pr of the path (X t , 0 < t < T) (see Definition 
©. 

Definition 4. For J £ N+, j = 1, ..., J, let 1 j(x) = 1(^- < x < j) be the indicator function of 
the j th sub-interval and 


i’jix) = / ij{y) d yi for 3 = 

Jo 

if 0 {x) = 1 . 

Let Vj = span{i/»j : j = 0,..., J} be the space of linear splines with knots at {0, j, j, ..., 1} 

and Uj 1 = {v £ Vj : f Q v(x)pjy(dx ) = 0 } be the subspace of functions L 2 (pn)— orthogonal to 
constants. 

Consider the generalized symmetric eigenproblem: 

Eigenproblem 5. Find ( 7 , u) £ M x Vj with 0, such that 

l(u, v) = 7 g(u, v), for all v £ Vj, 

where g, l : Vj x Vj —>• M are symmetric, bilinear forms defined by: 

g(u,v) = / u(x)v(x)p N (dx), 

Jo 

1 N ~ 1 

l(u,v) = 'y ^ — u(A„a)) ( ,y (^"(n-(-l)A) — ^(A^a)) ■ 


n—0 


When the observed sample visits at least twice every interval 7 ), the form g is positive 
definite on Vj, while l is positive semi-definite on Vj and positive definite on Vj. In such a 
case, Eigenproblem [5] has dim(Vj) = J+ 1 solutions with non-negative eigenvalues 

0 < 70 < 71 < ... < 7 j and g —orthogonal eigenfunctions. It is easy to check that 70 = 0 is an 
eigenvalue which corresponds to the constant function. Since the eigenfunctions are g— orthogonal, 
it follows that Uj £ Vj for 1 < j < ,/. Consequently, 71 > 0. 
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Definition 6. Let 


Ci = l0g ^ < 1 ) and u\{x) = ^2u ltj ipj(x). 

i =o 

When uij ^0 we define the spectral estimator by 

— 2 Ci /p 1 ipj(x)u 1 (x)'p, N (dx) 
fo ip'(x)uijjiN(dx) 

3 =i 



The condition 1 (A 7 i < 1) is a technical assumption which ensures that the estimator Ci is well 
defined. As explained in Section l!Q1 1 — A 71 is the estimator of the largest nontrivial eigenvalue of 
the transition operator. When A 71 > 1, the estimated transition operator is negative definite on 
Vj, thus the spectral approach will not provide a reliable output. Proposition 1201 and inequality 
m ensure that A 71 < 1 with high probability, both in high and low-frequency regimes. 


1.2 High-frequency convergence rate 

The estimation of volatility at point x is possible only when the process spends enough time around 

x. 


Definition 7. Set T > 0. Define the occupation density 

L j 1 

t * r = TO*' 


where Lt is the semimartingale local time of the path (X t : 0 < t <T). 

For any bounded Borel measurable function /, the following occupation formula holds: 


( 2 ) 


1 

T 


r 1 

f(X s )ds = / f(x)p, T (x)dx. 
Jo 


(3) 


In order to obtain the global rates of convergence, we must assume that the occupation density 
of the observed path is bounded from below. Therefore, for a given level v, we study the risk of 
the estimator conditioned to the event 


C v = < inf ut(x) > v > . 
l®e[o,i] J 

Theorem 8. Grant Assumptions [7] and [H Fix T > 0, 0 < a < b < 1 and v > 0. Choose 
J ~ A -1 / 3 . For every e £ (0,1) and A > 0 sufficiently small, there exists an event TZ e , of 
probability larger than 1 — e, and a positive constant C e , such that 

sup E a . b [l TCe n£„ • ||Ss - CT 2 ||Li([a,fc])] < C e At 
( a,b)ee(d,D ) 

Hoffmann [3 Proposition 2] shows that the rate A 1 / 3 is optimal in the minimax sense even 
in the class of diffusions with Lipschitz volatility. To prove Theorem [8] we compare with 
the benchmark Florens-Zmirou estimator, see Section E2I While the consistency of the spectral 
estimator can be obtained using the well known path properties of diffusion processes, the proof of 
the exact convergence rate is rather demanding. As explained in Section^. 2 1 it is necessary to show 
the regularity properties of the estimated eigenfunction u\, which requires rather sophisticated 
arguments from the perturbation theory of differential operators with non-smooth coefficients. 
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1.3 Low-frequency convergence rate 


In the low-frequency regime, we need to threshold the estimator in order to ensure integrability 
and stability against large stochastic errors. As expected, <r| achieves the same mean L 2 rate as 
the original Gobet-Hoffmann and Reifi estimator. Furthermore, for a £ H 1 , this rate is minimax 
optimal, which can be obtained by the same proof as [111, Theorem 2.5]. 


Theorem 9. Grant Assumptions Q] and[ H Fix A > 0 and 0 < a < b < 1. Choosing J 
holds 


sup E CT) 6 

(a,b)GQ(d,D) 


tTg A D — a 2 


2 

L 2 {[a,b]) 


<N~v. 


Ns, it 


The general idea of the proof is the same as in Gobet et al. 0 or Q. We use the mixing 
property of the process A' to control the approximation error of the stationary measure /i by 
the empirical measure p n, see Corollary [45] Then, as discussed in Section f2. 11 we bound the 
estimation error of («i, u\) - the first nontrivial eigenpair of the transition operator Pa, obtaining 


|/?i - «i| + \\ui - ui\\ H i = O p (N 1/5 ). 


Finally, we bound the plug-in error of the spectral estimator as- A tenuous point is in that the 
estimator u\ converges to the eigenfunction u\ in the sense of mean H 1 norm only, hence we can 
not postulate a uniform positive lower bound on infa; £ r 0 w u'^x). Following Chorowski and Trabs 
fs], we are able to overcome this difficulty by applying the threshold a 2 s A D. 


2 Discussion 

2.1 Connection to the GHR low-frequency estimator 

In this section, we explain the relation between the defined estimator as above and the original 
spectral estimator introduced in 11, Section 3.2]. First, let us review the construction of the GHR 
estimator. 


Definition 10. As in Gobet et al. [ll|, Eq. 3.8] for u,v £ Vj let 


JV-l 


P{ u , v ) = ^ ^2 ( u ( X nA)v(X (n+1)A ) +v(X nA )u{X {n+1)A )). 


n =0 

A crucial observation is that, due to the appropriate weighting of the empirical measure, p 
becomes a linear combination of l and g. Indeed, using the summation by parts formula, we 
obtain 

( 4 ) 

Hence, for ( 7 i-,Ui)- any solution of the Eigenproblem [5j we have 

p{ui,v) = (1 - A ji)g(ui,v) for every v £ Vj. 

Denote 


(5) 


/?, = (!- Ay;). 


( 6 ) 


We conclude that the eigenpair (Tci, ui) is equal to the estimator of the eigenpair of the transi¬ 
tion operator which is defined in [11, Eq 3.11]. Taking into account that functions (tp-j) are not 
orthonormal, following [ll|, Eq. 3.12 and Eq. 3.7], we define the GHR estimator as: 

Definition 11. 


a GHRi x ) ~ 


2C1/0 ui(y)fi N (dy) 

(x)j2(x) 
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where 


£ = VaA with (fo)j = ([ [ i>i(y)i>j(y)dy] ) (f ipi(x)fi N (dx)) , 

j=0 xl J o -A' o 

is an estimator of the stationary density. 

Note that estimator as can be seen as a local average of a GHR . Indeed, since lj = 
integrating by parts gives us 




’S,j 


2Ci fo 4>j(x)(fo uiiy^Nidy^dx JA a 2 GHR {x)u' l {x)y{x)dx 


/A u[(x)'j2 N (dx) 


_ [?-1 u' 1 (x)12 N (dx) 


( 7 ) 


Since we focus on volatility functions in H 1 , the above averaging has no effect on the low-frequency 
convergence rate. On the other hand, there are multiple reasons why it is beneficial for optimality 
in the high-frequency regime. Firstly, since u\ is constant on every interval y], after aver¬ 

aging we do not have to estimate the density of the occupation measure (which is not regular in 
the high-frequency setting), but the occupation measure of the intervals y]. Furthermore, 

averaging reduces the variance of the estimator, which can be clearly seen in Figure |1] The intu¬ 
itive explanation of this phenomenon is that while the original estimator & GHR inherits the rough 
behaviour of the occupation density (via the inverse of the derivative of the eigenfunction u\ which 

has the same smoothness as the design density) this irregularity is removed by multiplication with 
—/ 

u^. 


2.2 Connection to the Florens-Zmirou estimator 


The general idea of the proof of the high-frequency convergence rate is to compare estimator 
as with the minimax optimal (see j~Ml . Proposition 2]) high-frequency estimator introduced in 
Florens-Zmirou 0. In this section, we recall the definition of the Florens-Zmirou estimator and 
discuss its relation to as- 

Definition 12. Define the time-symmetric version of the well known Nadaraya-Watson type 
estimator of the squared volatility coefficient, introduced in Florens-Zmirou 0, by 

J2n=0^j( X riA) + 1j(A"+1)a))PA+1)A ~ A) 2 

+ lj(An+l)A)) 

J 

a Fz,jij{ x )- 

j=i 


^2 _ 
a FZ,j ~ 

a Fz { x ) = 


Note that the underweighting of the first and last observation in the denominator of a RZ • appears 
naturally as an artifact of the time symmetry. 

Remark 13. We call a RZ a tinre-symmetrized version of the Florens-Zmirou estimator, since it 
is an average of the standard Florens-Zmirou estimators (c.f. 0 Eq. (1.1)]) constructed for the 
process (A,0 <t<T) and the time reversed process Y t = Xx-t- Indeed, let 


AAACA+da - X nA j 2 

A^AA) + En=l li(AnA) + IAAva)) ' 


<7?(A, Xa, ..., Ava) = 


( 8 ) 


Then 


<Xj(Xo , Aa, ■ Ava) + (To,F a, •••) Yna) 

a FZ,j = -o-■ 


Since stationary scalar diffusions are reversible, under the Assumption [51 the process (Y t ,0 < t < 
T) is identical in law to (A,0 < t < T). Hence, the statistical properties of estimator a RZ are 
the same as those of the classical Florens-Zmirou estimator. 
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Recall that ( 71 , ui) is an eigenpair of the Eigenproblem [5] From Definition [G] of the spectral 
estimator, it follows that 


^2 Ci 2l(ui,ipj) 

a s,j = 


71 "“1 ,3 57 -1 m{dx) 


(9) 


A similar representation formula can be established for the time symmetric Florens-Zmirou esti¬ 
mator dp Z . 

Definition 14. Define a bilinear form / : Vj x Vj —> R by 


^ 1 f 1 

f(u,v) = - J u\x)v'{x)d 2 FZ (x)jl N {dx). 
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Consider vector such that Vj ^ 0 for every j = 1,J and the associated function 

v G Vj. We have 


— 

a FZ,j ~ 


3_ 

v o 57 -1 m(dx) 


( 10 ) 


As will be thoroughly explained in Section HOI when A — > 0 , the eigenvalue ratio — C 1/71 i n ® 
tends to 1. Consequently, the difference between estimators a 2 s and &5z is controlled by 


3 

57-i m(dx) 


(ii) 


The main observation is that in the high-frequency analysis, we do not have to control the es¬ 
timation error of the derivative u' l . Indeed, to bound (IIII, we need only to show a uniform 
lower bound for U\j and an upper bound for the difference \l(ui,ipj) — Unfortunately, 

\l(v, tpj) — f(v, % } i)\ is not small enough for any bounded function v. To achieve the required upper 
bound for the estimated eigenfunction, we need to first obtain some regularity properties of u \, 
which is the most difficult part of the high-frequency analysis. 


2.3 A Numerical Example 

In this section, we present the numerical results for the volatility estimation across different ob¬ 
servation time scales. We compare three estimation methods: the time symmetric Florens-Zmirou 
estimator ( see Definition IT2)l . the spectral estimator ctq HR (see Definition [Til c.f. Gobet et al. 

0 Section 3.2]) with approximation space Vj of linear splines with equidistant knots, and finally, 
the locally averaged spectral estimator a 2 s . We apply an oracle choice of the projection level J, 
minimizing the risk. 

We compare the locally averaged spectral estimator with benchmark estimators &5z an( l 
ctqhr in both high and low-frequency regimes. Following Chorowski and Trabs Section 5] 
we consider diffusion process X with mean reverting drift b(x) = 0.2 — 0.4x, quadratic squared 
volatility function a 2 (x) = 0.4 — (x — 0.5) 2 , and two reflecting barriers at 0 and 1. This choice of 
diffusion coefficients is supposed to minimize the reflection effect alongside with some variability in 
the volatility function. Nevertheless, the depicted behaviour is typical for other diffusion processes. 
The sample paths were generated using the Euler-Maruyama scheme with time step size A/100 A 
0.001 with reflection after each step. All simulated paths were conditioned to have an occupation 
time density greater than v = 0.2. Table |T] presents the oracle mean T 1 ([0.1,0.9]) estimation error 
of a 2 , obtained by a Monte Carlo simulation with 1000 iterations, in high (T = 5, A —► 0) and 
low (A = 0.25, T —> 00 ) frequency regimes, respectively. The estimated volatility functions for 20 
independent paths are depicted in Figure [T] 

In the case of high-frequency observations, d 2 s performs similarly to the benchmark estimator 
a^z- Relative to ||cr 2 1|z.i([o, 1 , 0 . 9 ]) ~ 0.28, the error decreases from approximately 6% for A = 
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High Frequency, A = 0.0001, T = 5 

Locally averaged spectral estimator 



Low Frequency, A = 0.25, T = 12500 

Locally averaged spectral estimator 

' ^ I 

0.4 

i _i 

W \ 

0.2 

0 




Time symmetric Florens-Zmirou estimator 


Time symmetric Florens-Zmirou estimator 



Gobet, Floffmann and ReiB spectral estimator 


Gobet, Hoffmann and ReiB spectral estimator 



Figure 1: Estimated volatility functions for 20 independent trajectories. 


10~ 3 to 3% for A = 10“ 4 . The estimation error of spectral estimator a GHR is almost twice 
as large, although the quality of the estimation improves when A decreases. It is important 
to note that the oracle values of space parameter J for &q HR are much bigger than those for 
other estimation methods. When A is small, the eigenfunctions inherit the regularity of the local 
time; the increase in dimension compensates for the projection error. Due to local averaging, 
this irregularity problem does not appear for o R , compare with Figure [H where estimator u^q HR 
oscillates heavily. Furthermore, there is no visible boundary effect, suggesting that the error rate 
of the spectral estimator does not deteriorate outside the fixed interval [0.1, 0.9]. 

In the low-frequency regime, a 2 s performs slightly better than the original spectral estimator 
&qhr- The boundary problem is visible, especially for o 2 GHR . The relative error decreases from 
12% for T=1000 to 5% for T=30 000. The Florens-Zmirou estimator a RZ underestimates the 
volatility and commits a relative error of 30% . This is expected and due mostly to the boundary 
reflection, which, for low-frequency observations, is not negligible in the interior of the state 
space. As found by unreported simulations, in the case of low-frequency observations, the locally 
averaged spectral estimator a 2 s will outperform the Florens-Zmirou estimator in the case of a 
highly varying volatility function er 2 , even when the sampling frequency is big enough to ignore 
the reflection effect. 

2.4 Extensions and limitations 

Stationarity of process X. In the high-frequency analysis the stationarity assumption ensures 
that process X is time reversible. General initial distributions could be considered, but in order 
to preserve the performance of the estimation for the time reversed process, the coefficients of the 
backward process must belong to the nonparametric family 0. 

Due to the spectral gap of the generator, process X is geometrically ergodic. In particular, 
as t —> oo, the one dimensional distributions of X t converge exponentially fast to the invariant 
measure p. It follows that, in the low-frequency regime, the assumption of stationarity can be 
made without loss of generality for asymptotic results. 

Estimation at the boundaries. In the high-frequency regime, we prove the error bound in 
the interior of the state space. Restriction to the interval (a, b ) allows us to obtain uniform lower 























High-Frequency Regime: T = 5 



A = 0.001 

A = 0.00075 

A = 0.0005 

A = 0.00035 

A = 0.0002 

A = 0.0001 

~2 

17 GHR 

0.0388(^g) 

0.0353(23) 

0.0322( 24 ) 

0.0292( 32 ) 

0.0258(3 6 ) 

0.0220( 49 ) 

*a 

0.0195( 9 ) 

0.0174 (10) 

0.0149( 10 ) 

0.0131( 12 ) 

0.0108( 13 ) 

0.0088(ig) 

S FZ 

°-0169( 10 ) 

0.0153( 11 ) 

0.0133( 12 ) 

0.0119( 12 ) 

0.0100( 13 ) 

0 0080( 2 q) 


Low-Frequency Regime:A = 0.25 



T=lk 

T = 3k 

T=7k 

T= 10k 

T=15k 

T=20k 

~2 

17 GHR 

0.0386( 5 ) 

0.0333(0) 

0.0256( 11 ) 

0.0226( 1;L ) 

0.0198( 11 ) 

0.0178 (11) 

s s 

0.0310( 4 ) 

0.0245(g) 

0 . 0200 ( 7 ) 

0.0182 (g) 

0.0166(g) 

0.0155(9) 

°FZ 

0.0821(g) 

0.0823( 5 ) 

0.0823( 5 ) 

0 . 0822 ( 5 ) 

0.0823( 5 ) 

0.0824 (5) 


Table 1: Monte Carlo estimation errors in high and low-frequency regimes. The value of parameter 
J is given in the subscript. 


bounds on the derivative of eigenfunction u\, which, due to boundary conditions, are not valid 
in the entire state space. This restriction could be omitted by obtaining uniform bounds on the 
ratio of derivatives Ui.j±i/ui.j. Unfortunately, since our proof relies on a posteriori error bounds 
on solutions for perturbed eigenvalue problems, we do not have the sufficient tools to control the 
pointwise relative error of the eigenfunctions. Nevertheless, the numerical results suggest that the 
spectral estimation procedure also behaves well at the boundaries of the state space. 

In the low-frequency regime, the spectral estimator is unstable at the boundary due to Neu¬ 
mann boundary conditions for the eigenfunctions of the infinitesimal generator. Refer to 11 
Section 3.3.8] for a discussion of the boundary problem. 


Boundary reflection. Following previous works on the spectral estimation in the low frequency 
setting, e.g. jTl , 23, Bj], we consider an Ito diffusion model on the state space [0,1] with instanta¬ 
neous reflection at the boundaries. The assumption of a compact state space makes the construc¬ 
tion of the estimator easier and facilitates error analysis in the low-frequency setting, c.f. Reifi 
[24]. We point out, here, that the reflection assumption is not restrictive in the high-frequency 
setting. Indeed, consider diffusion X defined on the entire real line with drift b and volatility a. 
Let 


A(t) — j l [0 ,i]{X s )ds 
Jo 


be the occupation time of interval [0,1]. Assume that linp^oo A(t) = oo and define the right- 
continuous inverse 


C(t ) = inf{s > 0|A(s) > t}. 


Process Y t = Xc(t) follows the law of a reflected diffusion on [0,1] with drift b and volatility o. 
Assume now the given observations A'o, Aa, ..., A^a- The sub-sequence of the values that lie in 
[0,1] forms a chain of observation of Y. The sampling frequency is random (and depends on the 
path), but when A shrinks, it becomes close to equidistant. The difficulty in handling irregularities 
at the boundaries is similar to these found when considering the reflection effect. Unfortunately, 
while this reduction can be used under the assumption that A is small, it can’t be applied in the 
low frequency setting, hence it is not practical in the context of scale invariant estimation. 


Linear spline basis. The use of the linear spline basis is very convenient, as functions ifjj 
appear naturally after applying integration by parts to the locally averaged GHR estimator, see 
©• Nevertheless, unreported simulations suggest that the spectral estimation method performs 
as well with other bases. The Fourier cosines basis in [0,1] is especially efficient, consisting of the 
eigenfunctions of the reflected Brownian motion process. 
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Adaptivity. An important decision in the spectral estimation is the choice of the basis dimension 
J. The general problem is twofold: dimension J should adapt to the smoothness of the coefficients 
and simultaneously to the observation frequency. In [§} the authors applied Lepski’s method 
to construct a data-driven version of the GHR estimator that adapts to the smoothness of the 
volatility. In the case of the low frequency data, the same selection rule can be applied for the 
universal estimator as- The precise construction of a method that will adapt to the observation 
frequency remains open. 

The numerical study shows that the proposed estimator as smoothly interpolates between the 
high and low-frequency estimators. The optimal convergence rates in both frequency regimes leave 
out the question of the paradigm to use when one has to consider data. The different convergence 
rates in high and low frequency regimes raise the question of bivariate asymptotics with respect 
to both A and T. Nevertheless, because of the structural differences of the high and low-frequency 
data, we believe that such an analysis would be particularly challenging. 


3 High-frequency analysis 

We will write / < g (resp. g > /) when f < C • g for some universal constant C > 0. / ~ g is 
equivalent to / < g and g < /. 

The proof of Theorem [S] is presented in Section 13.51 and is accomplished in several steps. 
In Section 13.31 we prove the convergence rate of the time-symmetric Florens-Zmirou estimator. 
Section 13.41 is devoted to the proof of Proposition [2D] - the uniform bounds on the estimated 
eigenpair ( 71 , u\). In Section 13.61 we prove some technical results on the crossing intensity of the 
diffusion processes. 


3.1 Preliminaries 

From now on we take the Assumptions [T] and [2] as granted. Fix 0 < a < b < 1 and the level v > 0. 
For simplicity, set T = 1. Let J ~ A -1 / 3 . 

Sobolev regularity of the volatility implies that it is 1/2—Holder continuous. Indeed, by the 
Cauchy-Schwarz inequality it holds 


sup 

rc,ye[0,l] 


W 0 ) - °(y)\ 

\x-y\ 1/2 


I f v a'(z)dz 
him 1 1 

*,„e[o,i] \x~y \ 1/2 


< IMki. 


( 12 ) 


Recall Definition [7] of the occupation density yr- Formula ©, together with (f!21) . imply that yx 
inherits the regularity properties of the local time. In particular 

Theorem 15. The function y\ is almost surely Holder continuous of order a for every a < 1/2. 
Moreover, for every p > 1, we have 


sup E CTi b sup Pi(x) 

(a,6)G© *-a;E[0,l] 


sup E a ^[\yi(x) - Hi{y)\ 2p ] 

(ct,6 )G© 


< oo. 

(13) 

< Cp\x-y\ p . 

(14) 


Proof. Since a is uniformly bounded and 1/2—Holder continuous, the claim of the theorem can 
be deduced from the well known properties of the family of the local times ( L t ,t > 0 ) of the 
semimartingale X , see the proof of |25j, Chapter VI, Theorem 1.7] and the subsequent remark. □ 


Definition 16. Denote by oj the modulus of continuity of the path (A' t ,0 < t < 1), i.e. 


oj(S) = sup \X t -X s \. 

0<s,t<l, |£—s|<<5 
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Because of the ellipticity assumption a > 0 the path (X t ,0 < t < 1) shares the properties of 
Brownian paths. In particular, we can apply the Brownian upper bounds (see Fischer and Nappo 
0 ) on the moments of w: 

Theorem 17. For every p > 1 there exists a constant C p > 0 such that 

sup E (T! b[u; p (A)] < C p A p / 2 ln p (A -1 ) . (15) 

(o-,b)6© 


The proof of Theorem [T71 is postponed to Section [A] Using (fT5l) we can show that on C v the 
occupation measure p n is spread uniformly on [0,1] with high probability. 

Lemma 18. Let 

lh=C v n {w(A)||^ 1 || 00 < A 5/11 u}. (16) 

For A sufficiently small we have 

Pcr,b(£tJ \ Ltl) < A 2 / 3 . 

Furthermore, on the event 1Z\, for every 1 < j < J, we have 


v < J 


PN(dx) < ||/ii|| c 


The proof of Lemma [TH] is postponed to Section 13.31 

As mentioned in Section fl.2l we want to compare the spectral estimator cr| with the benchmark 
high-frequency estimator ctp Z . Before that, we have to prove a uniform upper bound on the mean 
L 2 error of the time symmetric Florens-Zmirou estimator. The result below is a generalization 
of Q Proposition 2], where the same rate was obtained under the assumptions of smooth drift 


and Lipschitz volatility. As proved in 14, Proposition 2] the rate A 1 / 3 is optimal in the minimax 


sense even on the class of diffusions with Lipschitz volatility. 

Theorem 19. Grant Assumptions\7\ and [ H Fix T > 0 and choose J ~ A~i. We have 


sup 

(<7,6)e0(d,D) 


• ||< 


FZ 


\L 2 [1/J,1-1/J] 


]' < A 1/3 . 


(17) 


Because of the reflection, the rate deteriorates at the boundary. For x G [0,1 / J\ U [1 — 1/J, 1] 

sup E^bfl-R.! • \ap Z {x) - cr 2 (x)| 2 ] 3 < A 1 / 33 . (18) 

(a,b)ee(d,D) 

The proof of Theorem [HI] is postponed to Section 13.31 The main idea is the decomposition of 
the error into a martingale and deterministic approximation parts as in [Til . Proposition 2 ]. As 
expected, under the high-frequency assumption, the reflection has an effect only at the boundary. 
Inequalities m ctnd (USD imply 

sup E CT , 6 [l TCl • II a 2 p Z (x) - cr 2 (x)|| L i [0il] ] < A 1/3 . 

(<J,b)eO(d,D) 


3.2 Outline of the proof of the high-frequency convergence rate 

Since by Theorem 17171 the estimator Op Z attains the optimal rate A 1 / 3 , to prove Theorem [8] it is 
enough to upper bound the mean Zv 1 [0,1] error between & 2 FZ and d%. Using representations (f?|) 
and (fll)l) af z — a 2 s can be reduced to the difference of the forms / and l (c.f. Lemma ET5l) . First, 
we need however to list the properties of the eigenpair (Ci,ui). The proof of the next Proposition 
is postponed to Section [3~T1 
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Proposition 20. Let 0 < a < 6 < 1 be fixed. For every e > 0 there exists an event IZ 2 = 
with P a,b{j^v \ F- 2 ) < e, and a constant C = C(e), such that, for A sufficiently small we have 

hz 2 • |7i| < C. (19) 

Furthermore, the eigenfunction ui can be chosen such that on IZ 2 

J J 

j = J and Uij ~ 1, and u 2 ^< 0) < 1 

1 =1 l=i 

hold for any j = \aJ\ — 1, \bj~\ + 1 . 

Remark 21. The normalization V'.j—j U?j = J is natural, as it is equivalent to ||ui||z ,2 = 1 . In 
short, Proposition [201 states the existence of uniform bounds on u\ |r a ,b]- Because of the Neumann 
boundary conditions on the generator, the separation from the boundary is necessary for the 
existence of the lower bound. 

Remark 22. From the general inequality 


|1 + log(l — x)/x\ < x, 0 < x < 1 / 2 , 


together with the uniform bound (1191) on the eigenvalue 71, we deduce that, on the high probability 
event 7 ?- 2 , |1 + C1/71I 1 $ A holds. Consequently, the eigenvalue ratio — C1/71 in © is of no 
importance in the high-frequency analysis. 

Definition 23. Define 


a h 


a 2 s {x) 


2l\u 1 ,fij) 

^ j 

u\,j S7-! m{dx) 
j 

1=1 


( 20 ) 


For simplicity, we will refer from now on to erg as to the spectral estimator. Comparing the 
representations m and m we obtain 


1 - 1,1 


~2 , 2|i(ui,^j) — f(ui, 1 pj)\ 

a FZ,j I = 


Ulj f /-I flN^dx) 


Since by Proposition [201 the derivative Uij has a uniform lower bound, Lemma ITTH implies that to 
show the convergence rate A 1 / 3 we have to prove that 


\l{ui,f>j) - = O p { A 2/3 ). 


As argued in Proposition 1301 for any function v € Vj with bounded derivative, it holds 

I Kv , il > j ) ~ = O p ( A 1/2 ), 

which leads to a suboptimal rate A 1 / 6 . In order to achieve the optimal rate A 1 / 3 we need to 
use the regularity of the first nontrivial eigenfunction u\. By the means of the Perron-Frobenius 
theory, in Proposition 1371 we prove that for some high probability event IZ 3 


E, 


er.fc 


l-R-3 


(4±4)-ai(4) 


\j 


J " 1 / 2 


< 1 


holds, which can be interpreted as the almost 1/2—Holder regularity of u\ (see Remark [551) . 
This regularity of the eigenfunction allows us to reduce the estimation error to an approximation 
problem of the occupation time, see decomposition (EZD and Lemma Ei 
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3.3 Proof of Theorem 1191 

We begin with the proof of Lemma 1151 

Proof of Lemma \18[ Note first that, on the event C v , we have 

v<J Mi (dx) < HmiIIoo- 


Using the occupation formula ([3]), we obtain that 


JV-l 


1 ^ N 1 An+ 1)A 

77 5Z T(^»a) - 1 Mi (x)dx |lj(X nA ) - lj(X s )|ds < 

n=0 J L ~r~ „—n J riA 


n—0 ' 


r( n + 1)A 

e/ a 

n=0 • /nA 

c 


< 


' nA 
-+u;(A) 


Ki^.-¥ 




pi(x)dz 


< uj(A))ds + l(|X s — ^| < w(A))^ 
4 +c(a) 


ds 


’t-u( A) 


Hi{x)dx < 4w(A)||mi||oo- 


Hence, and since J ~ A 1 / 3 , on the event 7\li 


A3 W < A3?; - 4w(A)||mi||oo < / MAf(da:) < (A3 + 4 w(A))||mi||oo < As 11^!!!^, 

liolds for any A < 1. Finally, to prove that 7?-i is a high probability event, note that for any p > 1, 
Theorem [T7] together with the inequality (fT51) imply 

Pa.bOAA^i) < A' 5 P/ 11 E CT , b [a;(A)P||Mi||S 0 ] 

< A^ 5 P/ 11 E (T , b [ w (A) 2 P] 1 / 2 E ff , 6 [llMillS] 1/2 

< A^ 5 p/ 11 A p/ 2 ln p/ 2 (A^ 1 ). 

We obtain the claim by choosing p > 15. □ 

Now, we are ready to prove Theorem[]j|J The main ideas are as in (Til . Proposition 2], The nov¬ 
elty consists on the direct treatment of the drift term and the analysis of the boundary behaviour, 
which is an artifact of the reflection. 


Proof of TheoremWA Set 1Z = TZ\. Recall the definition © and the discussion thereafter. It 
follows, that it is sufficient to prove the claim for ct 2 (Xo, Xa, ...,Xata)- 
Since 

1 N ~ X 1 f 4 

j(X°) + ^ 1 j(X„a) + -Ij(Xjva)) = / i ^N{dx), 

n=l J ‘ L l~ 

by Lemma 1181 on the event 1Z \, the denominator of ct 2 (X o, X a , X aa ) has a uniform lower 
bound of order A 1 / 3 . Hence, in order to prove HID, we have to show that, for any j = 2,..., J — 1 
and x £ j], we have 


AT-l 


E„ 


1-Ri • j ^2 1 i( X nA)({X {n+1)A - X nA f - Act 2 (a 


n—0 


Indeed, CD implies 


<AV 2 (yJ [(a 2 )'(i/)] 2 dy)VA 2 / 3 . 

( 21 ) 


j-i 


E, 


a,b 


L Ki 


l°>z - a ”lli 2 [i/j,i-i/j] 


< 


,3 + 1 
J 


Jt=i [(^ 2 ) , (2/)] 2 ^ + aI ) =Al(||a 2 || 2 ff i+l). 


i =2 
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Step 1. Error bound in the interior. Fix 2 < j < J — 1 and x £ [^-j—, j]. Note that on the 
event 7?.i the condition lj(X„A) = 1 implies that no reflection occurs for t £ [?rA, (n+l)A]. Using 
Ito formula we can decompose 


N-l 


'y ' lj(X n A)((X( n +i)A ~ A„a) 2 — Ac" (a;)) :— A\ + A 2 + A% + A 4 , 


n—Q 


where 


Ai = Em x «*) 


Ao = 


A* = 


n =0 
N—l 


n —0 


(n+l)A 


n(n+l)A 


A 4 = -2 


(X s )dW s Y - f 

' J n 

E lj(X n A) {a 2 {X s ) - a 2 {x))ds, 

n—0 JnA 

N-l r(.n.+ 1)A 2 

E li(^nA)( / &(*,)<*«) , 

£^0 WnA ' 

N—1 /*(n+l)A /*(n+l)A 

E h( X nA) / ff(X s )dlU s / 6(X s )d S . 

M _n •/ nA •/ nA 


cr 2 (X s )ds 


We will bound the second moment of each of the terms A 1 , A 4 . First, note that arguing as in 
the proof of Lemma [THl we obtain 


JV-l 


jy E ^'(^a) < (A 3 + 4w(A))||^i|| 00 . 


( 22 ) 


n =0 


Consequently, from the Cauchy-Schwarz inequality, together with Theorem [T7] and the inequality 
m follows that 

JV-l 


E 


<a,b 


(jv E ^(^a)) 


2-i 


n=0 


< A 3 . 


(23) 


Denote by JF n the cr—field generated by {X to a : 0 < m <n}. Let 


p(n+ 1)A 2 n n ~ l-ijii 

( / CT (X s )dW s ) - / a 2 (X s )ds. 

X JnA ' JnA 


n+l)A 


In = 


Since ( r/ n ) n are (X n )—martingale increments, they are conditionally uncorrelated. Using the 
Burkholder-Davies-Gundy inequality we obtain that ^ a .b[ r in\ X n} < A 2 . Consequently, 


N-l 


N-l 


E, 


= ( E E °Ah( X nA Wn\Y < A^E CT , b [l E M^nA)]* < Al, 


n=0 n=0 

where we used (1231) to obtain the last inequality. On the event 7 Zi, when lj (X ri A ) = 1, we have 
\X„ -x\< \X S - X„ A | + |X„ A -x\< w(A) + A 1 / 3 < A 1 / 3 . 

Using the Cauchy-Schwarz inequality we obtain that 

N-l /.(n-(-l)A r X a 1 (V-1 _ N 1/2 


A 2 A 


E M^a) / / (<J 2 Y(y)dy ds < — E lj (X nA )A 1/6 ( / [{a 2 )’(y)] 2 dyj 

n- 0 dnA 7® n =0 dij 3 


Hence 


E., 6 [l Wl • ^ < A 1 / 2 ( [ I [(a 2 n y )] 2 dy) 


1/2 
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The drift function b is uniformly bounded, hence |^ 3 1 < A. Denote 

Y t = [ a(X s )dW s andwr(A) = sup \Y t -Y„\. 

JO 0<s,t<l 


\t— s|<<5 


The uniform bound on b , together with |Y( n +i)A — L^aI < wy(A), and the inequality (1221) imply 


Ecr,b[l 1h ■ A%\ 2 < E CTib 

< E<t ,6 


N-l 


1 n 1 ■ 22 li(-Y„A)wr(A)^) 

n—0 

(AiH/nlloo^lA))' 


< A3, 


where we used uniform bounds on the moments of modulus of continuity of semimartingales with 
bounded coefficients (see Theorem [T71) . 

Step 2. Error bound at the boundaries. Set j = 1 (the case j = J follows analogously) and 
x £ [0,1/J], On 72-1, whenever X nA > A 5 / 11 , no reflection occurs for t £ [nA, (n + 1)A]. Denote 

li(x) = l(x < A 5 / 11 ) + 1(A 5 / 11 < x < J ~ l ) := li,o(x) + li,i(x). 


We decompose 


with 


JV-l 


22 ll(^nA)((^(n+l)A — X n /\) 2 — A(J 2 (x)) El + E 2 , 


n =0 


N-l 


El — ll,o(-XnA)((-X'(ra+ 1 )A ~ A.„,a ) 2 — Aq 2 (x)), 

n—0 
N-l 

E 2 = 22 ll,l(^A)((X (n+ i)A - X„a) 2 - Aa 2 (x)). 


n—0 


On 1Zi holds |(X( n+1 )A — A„a) 2 — Acr 2 (x)| < A 10 ' 11 . Hence, arguing as in the proof of Lemma 
ITHl we obtain that 


AT-l 


E„ 


1 -R.! • E x 


<A nE CT , b 


<A nE-i 


Ki'(n 22 1 l, 0 (-^nA)) 

n =0 

n-l ' (yJ Q li,o(x)hi(x)dx + 4w(A)||/xi||co) 


< Att. 


To bound the second moment of E 2 , note that when 1| ,1 (Wj.a) = 1 no reflection occurs for 
t £ [nA, (n + 1)A]. Consequently, we can proceed as in Step 1, obtaining 


E<r ,b 

We conclude that 


1 Hi ■ E 2 


i < A l/ \j\^ 2 )\y)] 2 dy) 1 + a 2 / 3 < a 1 / 2 ||cr 2 1| 


H 1 - 


E, 


(7,6 


• I Ofz(x) - < j2 ( x )\' 2 


< A" 1/3 E. 


a,b 


Iki • (Ei + E 2 y 


< A 1 / 33 . 


□ 


Corollary 24. For every e > 0 and A sufficiently small, there exists an event TZ = TZ(e) C TZ \, 
with P a ,b(E v \ TV) < e, such that on 1Z 

ap Z ( x ) ~ 1 f or every x £ [0,1]. (24) 
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Proof. From Theorem [Hi] follows that 

E CT ,b[ 1 -R.i • W&FZ ~ 0-2 Hz, 2 [ 0 , 1 ]] ~ A 1//3+2//33 . 


Set e > 0. Let 

TZ = TZ 1 n{\\a 2 FZ -a 2 \\ 2 L2 < ( 2 J ) _1 inf cr 4 (z)}. 

rce[0,l] 


From Markov’s inequality, together with the lower bound on the probability of the event 7 Z±, 
follows that 


F atb (C v \TZ) < A 2 / 33 . 


Hence, for A sufficiently small, we have P <Ti b(£ 1 ,\7?.) > 1 —e. Since \\&Fz~ a2 \\oo — — ct 2 IIi , 2 

we conclude that on 1Z holds a FZ ~ 1. □ 


3.4 Properties of the eigenpair ( 7 !, in) 

In this section we want to prove Proposition [50] Because of the tridiagonal structure of the form 
l, the direct analysis of the eigenfunction u\ is difficult. Instead, we consider the generalized 
eigenvalue problem for forms / (recall Definition 11411 and g\ 

Eigenproblem 25. Find (A ,w) tlx Vj , with «i/0, such that 

f(w,v) = Xg(w,v) for every function v £ Vj. 

On the high probability event 7?. 2 C IZi such that a FZ ~ 1 (see Corollary [Mil . the form / 
is positive-definite and symmetric. Consequently, on 72-2, the Eigenproblem [25] has J solutions 
(A with 0 < Ai < A 2 < ... < A j. 

Definition 26. For j = 1,..., J define if® = f)j — ifj( x)/2jv( dx) € Vj . Let 
Ki : = fO’i’4>°) = fii’i, 4>j) and M itj = g{ip°, 


be the matrix representations of forms / and <7 on Vj x Vj with respect to the algebraic basis 


Arguing as in Gobet et al. 1 1 |, Lemma 6.1] we obtain that 


M, 


hj 



fiN{dx)dydz. 


'yVz 


(25) 


F is a diagonal matrix with strictly positive diagonal entries, hence it is invertible. Eigenproblem 
[25] is equivalent to 

F M(wij)j = X i (Wij)j , 

where indicates the coefficient vector associated to the eigenfunction Wi, i.e. 


J J 

j =1 3 =1 

with some i /70 such that Wi S Vj. Since the matrix M has all entries strictly positive, the matrix 
F~ X M satisfies the conditions of the Perron-Frobenius theorem. Consequently, the eigenvector 
(wi.j)j can be chosen strictly positive, which corresponds to the monotonicity property of the 
eigenfunction wi. In what follows, we will show that the Eigenproblem [55] is an approximation of 
the Eigenproblem [3] for forms l and g , and deduce that the eigenfunction ui inherits the properties 
of w 1 . Let || • ||j 2 denote the standard Euclidean norm on 
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Definition 27. Set 0 < a < 1/42. Denote by TZ a the set of paths contained in C v such that 

(i) w(A) < A 1 / 2 ' 0 

(ii) for every x £ (0,1) holds a FZ (x) ~ 1 

(iii) II °fz ~ cr2 IU 2 ([>,&]) < A 1/3 "“ 

(iv) occupation density is 1/2 — a Holder continuous with Holder norm bounded by a -1 


Remark 28. By Theorem IT71 Corollary [Ml and the regularity properties of the occupation density 
H i, for every e > 0, there exists a = a(e) such that, for A sufficiently small, P cr .b(£„ \ lZ a ) < e 

(fT51) imply that on the event lZ a 


holds. The assumption (iii) and Holder regularity of a 2 

I &fz( x ) ~ a2 ( x )\ < A 1 / 6 ^ 0 for all x £ [a, b\. 


(26) 


By the assumption (iv) we have ||tti||oo 1- Furthermore, arguing as in the proof of Lemma 1181 
we obtain 


R N (dx) — / pi(dx) < w(A)||/Lti|| 00 < A 1/2 “. 


(27) 


In particular, on lZ a 



A 1 / 3 holds for every j = 1,..., J. 


(28) 


To bound the error between the solutions of the Eieenproblems 151 and Pol we need to establish 
uniform bounds on the spectral gap of the Eigenproblem 1251 

Lemma 29. On the event lZ a the eigenvalue X\ is uniformly bounded. Furthermore, the Eigen- 
vroblem \25\ has a uniform spectral gap, i.e. A/ 1 — A^ 1 > 1. 


Proof. Consider the generalized eigenvalue problem: 

Eigenproblem 30. Find (A, w) £ 1 x Vj with w ^ 0 and f Q w(x)n\(x)dx = 0 such that 


/ w'(x)v'(x)a 2 (x)p,i(x)dx = A / w(x)v(x)pi(x)dx, 

Jo Jo 

for all v £ {v £ Vj : / v(x)/i±(x)dx = 0}. 
Jo 


Eigenproblem DHl has J solutions, denoted by (A j,Wj) with 0 < Ai < A 2 < ... < A j. By 
Proposition [50] we have Ai ~ 1 and A/ 1 — A./ 1 > 1. Let M,F be J x J matrices corresponding 
to the Eigenproblem [501 tested with functions ('/’}) where i/d = ifj — ipjfj,i(x)dx. As in 

the case of the data driven Eigenproblem [55] we have 


Mij — 


F — 

r i,3 ~ 


r 7 ri ry Az r 1 

/ / / pL\(x)dx / fii(x)dxdydz, 

jiA j ^- Jo j uVz 


' yvz 

"0 -i^j 

ff-icr 2 [x)n 1 (x)dx :i = j 


and 


F M(wij)j — X i ( Wij)j. 

From Weyl’s theorem for symmetric eigenvalue problems follows that 


(29) 
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We will show that \\F~ 1 M — F~ 1 M\\p < A 1 / 6_ “. Then, the uniform bound on the eigenvalue Ai 
and the lower bound on the spectral gap will follow from the properties of the Eigenproblem 157)1 
First, let us observe that by (1271) and (l26l) . on lZ a , for any j = 1 ,J we have 


I Fj,j Fjj | — 




&fz ifiN(dx) - / a 2 (x)pi(x)dx\ 


< 


<ap Z j\ I m{dx) - j ni(x)dx\ + j i \d 2 FZ (x) - a 2 (x)\^i(x)dx < A 1/2 


In particular Fj,j,Fjj ~ A 1 / 3 . Arguing as in the proof of Lemma 1181 for any i,j = 1J, we 
obtain 

| M i}j - M itj | < J- 2 W (A)|| Ml |U < A 7 / 6 ” 0 . 

Furthermore Mj j, Mij < A 2 / 3 . Since F and F are diagonal matrices, it follows that 

|(F -1 M - F~ 1 M) i j\ < A 1/2 -“. 

Hence, - F _ 1 M || 2 2 < ^F^M - F~ 1 M ) 2 j < A 1 / 3-2 ". □ 

Proposition 31. Choose the eigenfunction w\ increasing and normalized so that || (tZ?i j)j ||i a = 
J 1 / 2 (i.e. Il^iHi 2 = lj- On the event lZ a , for any \aJ\ — 1 < j < \bJ~\ + 1 and any i = 1,..., J 
we have 

1 V v)i : i < Wij A 1, (30) 

Furthermore for j s.t. J 1 / 2 < j < J — J 1 / 2 


Wl,j±l 

uh,j 


1 < A 1/6_ “. 


(31) 


Proof. In the proof we will use standard techniques from the Perron-Frobenius theory of nonneg¬ 
ative matrices (cf. Mine |20j, Chapter II]). In particular, we shall repeatedly use the following 
inequality Mine (2d|, Chapter II, Section 2.1, Eq. (7)]: for any q\ , q 2 ,..., q n >0 and p\ , p 2 , ■■■,p n G K- 


Pi , Pi +P2 + +Pn 

min — < - 

i=l,-,nqi qi+q 2 + -- + q n 


< 


Pi 

max —. 
i=l,...,n qi 


(32) 


Step 1: (w\ } i < 1). Fix 1 < i < J. By Definition I27j(ii)[ relation (1281) and Lemma 1251 on the 
event lZ a , we have 


r-j ^ ^ ^ ^ 

~ wx ti d 2 FZi pn{x) = f(wi,ifi) = Xig(wi,ipi) ~ g(w 1 ,i/j. i ) = ^ M itm w ltrn . (33) 

^ ^ m= 1 


Hence, by the Cauchy-Schwarz inequality 


Wl,i < -/( ^ M lm) 7 ( J2 ™bm) ' ~ 


1/2 


where we used < J ~ 2 and the normalization of {wi j). 

Step 2: (wi t i < w\ t j). Fix |_<zJ r J — 1 < j < \bJ] + 1. On the event TZ a , for any 1 < i < J the 
relation (1551) together with the inequality (1551) imply 


®l,i Era=l M i Mi 

,m 

—— ~ _ _ < max — 

W i n V' J M ■ irh m=l,...,J M ■ 

/ jjn — i lvl j,m uJl,m lvl j,m 


(34) 
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We need to show that for arbitrary to Mp m /Mj, m < 1 holds. Consider first the case i < j. Then 
by ( 1551 ) 


M ltm 

Mj,m 


fZ 1 fZi C Z m{dx) fy Vz fi N (dx)dydz 


< 


IZi fZ 1 fo AZ MN(dx) fy Vz il N (dx)dydz 
fZi fZi m(dx) fy V i -1 p N (dx)dydz 

IZi f Zi fo A ~ m(dx) fy V j_ p N {dx)dydz 
fZi f(y) Lvi -1 fii\{dx)dy 


fZ1 f{y) f yV $ V'N{dx)dy 


where f(y) = f 0 yA J p N (dx). Consider m > j. For y £ [yy^, y] holds y = = yV 1 -y, hence 

the numerator and denominator are equal. Consider to < j. For y £ [yy-, y] holds y V y = y 
Hence, using (1251) . we obtain 


fZi f{y) fLi -1 p N (dx)dy 


Mj, m < j \yj j y y 

Mj,m ~ 


< 


fZ 1 f(y)dy 


fZi f{y) f! v i p N {dx)dy / m J _i f(y)dyjj p N {dx) 

j * j j j 


y N {dx)^ ~ (l - j) < 1. 


We conclude that for i < j and arbitrary m bound Mi tTn /Mj tm < 1 holds. Proceeding analogously, 
we obtain the same claim for i > j. From ( 1511 ) follows that on the event lZ a , for [aJ J — 1 < j < 
\bJ~\ + 1 and any 1 < * < J , we have 

(35) 

Step 3: (1 < W\ t j). Let Wij 0 = min^ Q jj_ 1 <J <p b j ] +1 Wij. Inequality ( 1551 ) implies 


1 = 


2—1 


Step 4 : (v roo f °f H31\) ). We will only show | w ~f +1 — l| < A 1 / 6 “, the other bound can be 
obtained by a symmetric argument. First, note that from Definition I27|(iv)| together with the 
inequality ( 1571 ) follows that 


/; 


p N {dx) - / pN(dx) 


< 


Pw(dx) - 


pi(x)(dx) 


/; 

T 


pi{x){dx) — / pi{x){dx) 


+ 


pi (x)(dx) - / pN^dx) 


< ^l/2-a + A l/3 A (l/2-a)/3 + A l/2-a < A l/2-a. 


Hence, by 


JT 1 

J? 5- VNidx) 


- 1 


< a 1/6_ “. 


! 7 - 1 Mw(da;) 

Similarly, by the 1/2—Holder regularity of <r 2 and Definition I27((ii)| together with ( 1551 ) we have 


a FZ,j+l 


- 1 




< a 1/6- “. 
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Consequently, instead of w ~^ +1 we may consider 


-i+i 

Wl,j+lV FZ ,j+l S~£ m(dx) y j 


Em=1 


Wl,j&FZ,j f /~i P'N(dx) Em=l Mj tm Wi t r 


By the inequality (1521) 


ddj'+l.m ^ E0m=l ^dj+l,mWl,r, 


]\/[, 


J,m Em= 1 Mj !m W\ t r 


< max 


Mi 




A/f . 


Thus, it is enough to show, that for any m = 1,..., J bound 




M, 


- 1 


< A 1 /® holds. 


M. 


Mj,m 


I7~ fZ=i C Z m(dx) f* Vz fhv(dx)dydz 


y\/z ‘ 


fl -1 iXi /(T* Miv(da;) Lv, h N (dx)dydz 


y\Jz I 


< 


< 


fx fZ- i f7 A ^~ m(dx ) fi N (dx)dydz 

fZi [Z i / 0 yA ~ Miv(dx) p, N (dx)dydz 


tt ry A 


&/c 


0 
j+i 


jd N {dx)f(y)dy 


= 1 + 


,fm-i fi N (dx)f(y)dy 


/m -1 / 0 aA_7_ m(dx)f(y)dy / 0 yA ‘ T y N {dx)f{i))dy 

where /(y) = f * v j yisi{dx). Consider m < j — 1. For y S y] we have y = y A 2±i = y A 

hence the error term is zero. Consider m > j. For y £ [2^, t! we l iave 2/ A 
Consequently, using (051) . we obtain that for j > J 1 / 2 ~ A -1 / 6 

»T ryAJ 


. fXi / y ^VT y N (dx)f(y)dy f^_ 1 y N {dx)f(y)dy 

fZ -1 fn yA ~ j2 N (dx)f(y)dy fZ-i f 0 ~ y N (dx)f{y)dy 


2 J- 


j-i 

.7 


i -1 


< A 1 / 6 . 


Finally, symmetric bound 1-< A 1 / 6 can be obtained by similar calculations. 


□ 


In previous proposition we have established uniform bounds on the eigenfunction w\ . Next, 
we show that w\ is a good approximation of u\. 

Definition 32. Let L be the matrix representation of the form l with respect to the algebraic 
basis ((see Definition EB1) . i.e. 

L id := , ip®) 

On the event 7 Z a , for A sufficiently small, the matrix L is symmetric tridiagonal. We want to 
bound the error between the solutions of the generalized eigenproblems: 

M(wi) = \~ 1 F(w i ) and M(ui) =y 1 L{u i ). 


Lemma 33. On the event 7 Z a holds 


l-P - L\\[2 < A 1 / 2 " 


3a 


(36) 


Furthermore matrix L is invertible and 


\\L\\ l2 ,\\F\\ l2 ,\\L-% 2 \\\F 


-in —l 


A 1 / 3 . 
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Proof. Consider vector (vj)j £ with || ( v j)j||/ 2 = 1 and the corresponding function v = 
£ V?. Since 

J 

\\(F-L)v\\% = — T(v,ipj)\ 2 

3=1 

J 

= VI (vj—iLj—i,j + Vj(Fj,j ~ Ljj) + Vj+\Lj + \ t j) , 

3=1 

to obtain (|36p , wc just havo to argue that Lj i,ji \^j,j — I Lj + ij are of order A 1 / 2 3a . By 

the definition of the forms l and fj from the Eigenproblem [3] 

JV-l 

“A^j— 1,J I = 'y 1 (V’j —1 (-^(n+l)A) — V’j — l{X n A)){lpj (-^(n+l)A) — ^j{X n a)) 

n —0 
TV—1 

= < ^)l(X (n+ l)A > - ^„A)(X (n+1)A - 

n —0 

TV—1 

+ J2 > ^)1(*(„+1)A < - ^ (n+1)A )(X„ A - 

n=0 

N-l 

< J2 < ^)1(X ( „ +1)A > ^i)(X (n+1)A - X nA ) 2 

n=0 

AT—1 

+ ^ IpCiA > ■ L 7 i ) 1 (" Y (™+l) A < i j i )(^(„+l)A — A'„a) 2 - 

n=0 

Moreover 

I Fjj - Ljj | < - £ |1(X„ A < ^) - l(X (n+1)A < ^i)|(X (n+1)A - X„ A ) 2 + 

ti=0 

i iV - 1 

+ ^ ^ V-ViA < i) ~ l(^(n+l) A < 4)K^(«+1)A ~ ^-A) 2 

n=0 

Hence, it suffices to show that for any x G (0,1) 

TV—1 

^ l(X nA < ®)l(X (n+1)A > x)(X {n+1)A - X nA ) 2 < A 1 / 2-3 ". (37) 

n—0 

By Definition 12® on the event 7 Z&, we have 
TV-1 

^ ^ l(-^nA ^ ^)l(-^(n+l)A ^ ^)(-^(n+l)A -^tia) 

71=0 

i N ~i 

^ A_ 2 a jvE 1(| ^-^ Al/2 - a) ' 

71=0 

Arguing as in the proof of Lemma 1181 we finally obtain 

■ N — l r x+A 1 /’ 1 - a 

- V l(|X nA - si < A 1 / 2 - 0 ) < / p 1 (x)da; + W (A)||p 1 || 00 < A 1 / 2 - 0 . 

^ ±7^ T*-Ai/a— 
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Since F is a diagonal matrix with diagonal entries of order A 1 / 3 , we have ||F||, 2 , ||F 1 || i2 1 ~ 
A 1 / 3 . As argued above, on lZ ai the upper and lower diagonal entries of L are of order A 1 / 2-3 ". 
Since for any 1 < j < J holds \L 31 — Fjj | < A 1 / 2-3 ", matrix L is diagonally dominant with 
diagonal entries of order A 1//3 . Hence it is invertible and ||F||, 2 , HA -1 !!;! 1 ~ A 1 / 3 . □ 

Lemma 34. Eigenvectors (wij), (uij), normalized so that ||wi ||;2 = ||ui ||;2 = J 1 / 2 , satisfy on 
lZ a 

ll(^id) — (Wbj)ll * 2 ~ A _1 / 3 ||(F — L)wi\\p. 

Proof. Recall that (Aare the eigenpairs of the Eigenproblem [55J with ||(ttfj)||j 2 = \fj■ 
Theorem 26] implies that there exists an eigenpair (Xj 0 , J~ 1 ^ 2 Wj 0 ) such that 

iV-Vl < J" 1/2 IIF" 1 1| ; 21|(F-L)^|| ( 2 < ||F- 1 ||,2||J i -L|| l 2, 

< «5" 1 (A- 1 )||i ? - 1 ||f/ 2 ||^|| z 1 / 2 ||(i ? -i)^ 1 ||, 2 . 


where 5(A Jo 1 ) is the so called localizing distance, i.e. ^(A^ 1 ) = min ;7 , j0 |A A — 1 |. From Lemma 

[55] we deduce 

lA-'-ir'i^A 1 / 6 - 30 . 

By Nakatsukasa j2l[ Theorem 8.3] for any i = 1,..., J we have 



<||L- 1 ||,2||Ar 1 (^-i)lb2 I 


which together with Lemmas [2H and [55] imply 

lA]" 1 -!! -1 ! < A 1/6 - 3a . (38) 

By Lemma 1551 holds IA ]” 1 — A^ 1 ) > 1, hence we must have jo = 1- Furthermore, from the same 
uniform lower bound on the spectral gap follows 

^a- 1 ) = ^Ar 1 ) > i. 

Since by Lemma [551 we have ||F _ 1 || 3 2 // 2 ||F || ; 1 2 //2 < A -1 / 3 , we conclude that the claim holds. □ 
Proof of Provosition\20\ Set e > 0. By Remark [551 there exists a s.t. P a ,b{^-v \ TZ a ) < e. Set 

n 2 = n a n {||«)i - fh\\l < a 1 / 7 - 60 }. 


Step 1. We will show 

E ff , b [l Ka • || (F - L)w i||f a ] 1/2 < A 5 / 12 - 3 “. (39) 

In the proof of Lemma 1551 we argued that for any j = 1,..., J holds 

i>j)\ < A 1/2_3a . (40) 

Hence, using uniform bound (l30l) . we deduce 

\\w u ^)-f{w u ^)\<^ 2 -^. (41) 

We will use the regularity of the eigenfunction w\ to strengthen (TUI) . Consider J 1 / 2 < j < J—J 1 ? 2 . 

By ([501) 


\l{wi,4>j) - 


w h j 
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Inequalities CUl) and (ED) imply 


+ \Ki,il>j) - /(CV’j)| +Kipj+i,ipj) 


wij+i _ x 
WlJ 


- 0 + - 1 )) S 


while, since 7?. a C(Ri , from Lemma 1551 follows 


E, 


cr,6 




2^*2 


< A3. 


We conclude that for J 1 / 2 < j < J — J 1 / 2 


E„ 


1 R „ ' K(™l>Vb') - /( wi ,^)! 2 


< A5“ 4a . 


Since a < yg inequalities SD and (14211 imply 

J 

E a , b [l Ua ■ ||(P — L)wi\\'i 2 ] = ' l/fai^i) -f(fih,Vh)| 2 ] 

i=i 

< j 1 / 2 A 1_6a + JA 4 / 3_8 “ < A 5 / 6-6a . 

S^ep 2. 1 Z -2 is a high probability event. Indeed, inequality ( 1391 ) and Lemma [Ml imply 

• ll^i - ^i||? 2 ] 1/2 < A 1 / 12 - 30 *. 


Hence, by Markov’s inequality, 

V a , b {C v \ TZ 2 ) < 2e + A- 1 / 7 + 6 “E CT , b [l KQ • Hu?! - u^} <2e+ C a A 1 / 0 - 1 / 7 < 3e 

for A sufficiently small. 

Step 3. On the event 1Z 2 holds 


(42) 


max 1 ^ - Si.il 2 < V \wi,i - S M | 2 = ||«;i - «i||? a < A 1 / 7 6 “. 

2=1,...,J ^— J 

2 = 1 

Since a < 1/42 the eigenvector (uij) inherits the uniform bounds of the eigenvector (wij). In 
particular, for any j = \aJ J — 1,..., \bJ~\ + 1, we have 

Ulj ~ 1 . 

Moreover, since for any j = 1,..., J holds wij > 0, we deduce that 

J 

< 0) < \\wi-ui\\% < 1. 

j =i 

Finally, note that on the event 1Z 2 the eigenvalue 71 ~ 1 since on lZ a , by (1551) . holds IA / 1 — 7 -j -1 ! 
A 1/6_3q < 1 and A / 1 ~ 1 by Lemma [29] 


3.5 Proof of Theorem [8] 

As announced in Section f 2.21 we will bound the approximation error of the spectral estimator and 
the time symmetric Florens-Zmirou estimator by the difference of forms / and l. 
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Lemma 35. On the high probability event 1Z 2 from Provosition \2(A holds 


\bp 

\\vs-°Fz\\Li([a,b])£ E IK“i)^-)-/(“i>^j)l- 

j=[aj\ 


Proof. From representations (BUI) and (fTUl) follows that 


T FZ\\L 1 {[a,b]) 


\bJ 1 

E 

j=\aJ\ 


T S,j - a FZ,j 


< 


T IbJ] 

J S 

j—{aJ\ 


\l{ui,lfj) - f(ui,lfj)\ 
uij f]L i fiN{dx) 


By Proposition [UU) for j = [aJ J — 1 < j < \bJ] + 1, we have u\ t j ~ 1. Since, by Lemma ITfsl 

3_ 

J //_! )un((Hx) ~ 1, we conclude that the claim holds. □ 

—j— 

Proposition 36. For every function v £ Vj and any j = 1,..., J we have 


E<t,6[1 R! • | - l{v, 1 pj) | 


21 2 < 


K-rl' + Kl' + K+rl 2 )^, 


where v corresponds to the vector (t >j)j=i,...,j and vq,vj+i = 0. 


Proof. First, note that since for i ^ j holds f(ipi,ipj) = 0 we have f(v,ipj) = Vj f(i/jj,ipj). More¬ 
over, on the event 72i, for A sufficiently small, the increments of the process X are smaller than 
J -1 . Hence, for |i — j | > 1, holds ) = 0. Linearity implies 


l(v, ifj) = + v j+ il(ip j+ i,i/}j). (43) 

Consequently, it is sufficient to show that 

E a 4|T(^_i,V>;)| 2 ] 1 [I ?{Mi) ~ KMj) I 2 ] E 

+ E a 4m,iP J - 1 )\^ <Ai. (44) 

Decomposing the terms above like in Lemma[331 we obtain that (1331) follows from Theorem[3I] □ 

We are now able to prove the suboptimal rate A 1 / 6 for the root mean squared L 2 ([a, &]) error 
of the spectral estimator as- 


Proposition 37. For every e > 0 and A sufficiently small, there exists an eventlZs = 72 . 3 (e) C TZ 2 , 
with P CTi b(£„ \ 72 . 3 ) < e, such that for every x £ (a, b) 


E<r,fc[ 1 k 3 • \°s( x ) - <? 2 ( x )\ 2 ] 2 ^ A '- 


Furthermore, on IZ 3 , for every [aJ J < j < \bJ~\ we have 

a S,j ~ !. 


E<t.6 

1-R.3 ' 


2 1 



U hj 

"I 


(45) 

(46) 

(47) 


Remark 38. Given the uniform lower bound on the derivative uij, and since A 1//6 ~ J 1 / 2 , 
inequality (1371) can be reformulated as 


E„ 


1^3 


ii(4±4)- 


X(4) 


j- 1 / 2 


< 1. 


By means of Markov’s inequality the latter can be interpreted as almost 1/2—Holder regularity 
of u\. In that sense Proposition [37] is a discrete time equivalent of Proposition [531 which states 
that the derivatives of the eigenfunctions inherit the regularity of the design density, in the high- 
frequency case the regularity of the local time. 
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Proof of Proposition [23 Fix e > 0. Let IZ 2 be the high probability event introduced in Proposition 
EDI On 72-2, we choose the eigenfunction ui s.t. 


J 

u^ j = J and uij ~ 1 for every \aJ\ — 1 < j < \bJ] + 1 . (48) 

.7=1 

Step 1. Proof of \46]) . On the event 72i, for A sufficiently small, using the representation (1201) 
together with EH and 1)4811 we obtain that 


Ki’j .A) ^ ~2 ^ 1 + ipj + ipj + 1 , ifj ) 

-J -$ a S,j $- -1 - 

JZ1 /Pv(da:) / A /Jjv( dx) 

holds for every [aJ\ < j < \bJ]. Since 


(49) 


N—l 

—1 “I” Ipj “I” rS ^ ^ “1“ (^(n+l)A))(-^(n+1)A -^nA) 5 

n —0 

we deduce that ■ < cr|^ z .. Furthermore, since on IZ 2 holds 


1 N ~ x 

Z(AVb) > ^ E *(¥ + A5/11 ^ ^ - A 5 / 11 )(X (n+1)A - X nA ) 2 , (50) 

n =0 

the spectral estimator can be bounded from below by a time symmetric Florens-Zmirou estimator 
with bandwidth ^A 1 / 3 — A 5 / 11 ~ A 1 / 3 . Arguing as in Corollary [Ml we deduce that there exists 
a high probability event 7234 , such that on 72-3,1, bound a 2 s {x) > 1 holds for any x G (a, b). Set 


72 3 = 7^2 O 723,1. 

<S7ep 2. Proof of (J5\ ). Fix x G (a, 6 ) and chose j s.t. < x < Representations EH and 
m, together with Lemma 1181 imply 

\°s,j - °FZ,j\ 1$ A- 1/3 |T(ai,^-) - /(uiA)l- 

Hence, from Proposition [Ml and (E51) follows that 



By Theorem QIJ] and Holder regularity of a 2 

E CT ,b[l-Ri • \W 2 -vlzWlcV Z Al/6 - 

By the triangle inequality we conclude that (E5l) holds. 

Step 3. Proof of ffty - Set [aJ\ < j < \bJ~\ . We will only prove 



1 n 3 ■ 

Ai+i l 

2 1 



u i ,3 



(51) 


as the symmetric bound on the second moment of \n 3 


“1,3 


- 1 


can be obtained analogously. 


The general idea of the proof is similar to the proof of m in Proposition [3TJ First, we will show 
that m follows from 


E, 


iT.b 


l n 3 


uij+ia s>j+1 


3+ 1 

Jj PN{dx) 2l 

T 2 - 1 


/A m{dx) 


< A». 


(52) 
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To that purpose, by the triangle inequality and since on IZ 3 the derivatives Ui,j, Ui,j+i ~ 1, we 
have to argue that 

_ j+i 

°s, j+ 1 f? m{dx) 


E„ 


l-Rs 


- 1 


a s,j JA m{dx) 


2 i i 


< A 1/6 . 


(53) 


Step 3.1. Proof of [53) . By Lemma fl8l holds J fj _ 1 ]UN{dx), J fj 1 fiN{dx) ~ 1. We defined above 
the event AI3 so that 0/ d's ]+1 ~ 1. Hence, to prove (1551) . it suffices to show 


Ect ,6 


• k 

1 n 3 ■ 


s,j +1 ~ "Sjl 

j+1 


< A 1/6 


fl N (dx)- / A (A) I 


(l5Hl follows from fi5l) and 1/2 Holder regularity of a 2 . Indeed 


E<7,6 

+ E CTj {, 

< A 1/6 . 

To prove (155ll let 


1^3 ' ks,l+i — Tsy 

.2/3 + 1/2 


< E, 


<a,b 


< A 1 / 2 . 


1 |~2 3 J + 1/2 12 

•■'R-3 ' A,3+1 - CT (— J - ) 




+ E, 


<t ,6 


a 3 • k 2 (AP)-°i 


m(dx) - / jiN(dx) 


< 


3+1 

/ p\{x)dx — / i-ii(x)dx 
JL Ji^l 


fiN{dx) — / p,i(x)da 


p,i(x)dx — / fiisi(dx) 


■— Ei + £2 + A- 

By [Supplement A, Theorem 11] we have 

E ff>6 [E?+£g]* < A 2/3 , 

while the Cauchy-Schwarz inequality, together with [Supplement A, Theorem 8] yield 

r fJ _1 

EA-Ef] 2 = e < 7,6 / Mi(j + #) - Akr + x )dx 

1 Jo 


< 


1 

L J 


E ct ,6 [kl( j + x) - + x)\ 2 ]dx 


< A + 


(54) 

(55) 


.Step 3.1?. Proof of [53) . The representation CHI) . together with the eigenpair property of 
(71, A), imply that 


l ‘ij + i J Sj+i4 j faidx) = X(u u ili j+1 ) = g(ui,A+i) 

uijdg ■ [/_! fiisr(dx) K u ii' t l ) j) fl’kiiA) 

In what follows we want to apply methods from the Perron-Frobenius theory for nonnegative 
matrices. To that purpose recall the definition of matrix M from Section [TTT~T1 Eg. (1^51) . We have 

g{ui,lpj+ 1) _ Em=l M ">j+l«l, m 


?AA) Zi =1 MmjUl,r, 
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To bound the above ratio we would like to proceed as in the proof of inequality (Kill) in Proposition 
mi Unfortunately, we can’t, as we don’t know if the vector of derivatives (uij) is positive. Still, 
using the inequality (1521) and arguing as in the proof of (15T1) . we obtain that 


Em^l Mm,j+lUl >Q) 

EEm= 1 Mm,jU 1 ,777.1(^1,771 >0) 


1 < A 1/6 . 


To finish the proof we need to show that the possible error due to the negative derivative terms is 
small enough. On the event 7?. 2 we have 


j ^ 

Furthermore, on the event IZ3 we have l(ipj , 'iftj) > JJ_ 1 jj,N(dx); indeed we defined IZs such that 
the left hand side of (1551) has a uniform lower bound. Thus, by Lemma 1T51 


y: M mJ u i, m = g(u 1 ,^') > / At N(dx ) > A 1/3 . 

771=1 J 


Consequently, we need to show that 


j 

y, (M m j + M mJ+ i)|ui, m |l(ui, ro < 0) < AT 

771=1 


From (1251) follows Mij < J 2 . By the Cauchy-Schwarz inequality and Proposition 1201 


(Afmj- + M miJ - + i)|«i, ro |l(«i im < 0) < 

771=1 

J 1 

<j- 3 / 2 (yK m | 2 i(tti, m <0))'<Ai □ 

771=1 


To obtain the suboptimal rate A 1 / 6 we only used uniform bounds on the derivatives vector 
(uij)j together with the general error bound from Proposition[36l Having established the regular¬ 
ity of the eigenfunction ui , we are now able to argue that the error E^j, [l^ • 11( u\ ,ipj) — f ( u \, 'ipj)\\ 
is at most of order A 2 / 3 . 


Lemma 39. Denote I(x) 
j = 1,..., J, it holds 


= x — cq, with cq such that I £ Vj. For A sufficiently small, for every 


E, 


tT,b 


i TCl - 1 /( 1 , 


< A 2 / 3 . 


(56) 


Proof. We will reduce (l56l) to the term bounded in Theorem [42] By definition of the forms l , / 
and the representation (ITU it holds 


— r, £(*(n + l)A l/jj(X n a)), 

71=0 

^ y ] ( 1 j ( A "„ a ) + lj (- Y ( n + 1 ) A ))( A( n+1 ) A — A „ a )“- 

71=0 

We will analyze the error contribution of a single summand. When X nA , Aqn+i)A £ [j^i both 
forms contribute by b(X( n+1 ) A — A' nA ) 2 , hence cancel perfectly. When both A„ a , A"( n+1 ) A ^ 
-,j\ neither of the forms contribute. Since on TZi, for A sufficiently small, the increment 
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l-^(n+i)A ~ X nA \ < 1/J we deduce that the overall error | f(I,ipj) — 1(1,xpj )| is due only to 
summands with the increment X nA ,X^ n+ i^ A crossing the boundary of 7 ]. In such case the 
form / contributes by ±(X (n+1 ) A - X„ A ) 2 , while l by i(X (n+1)A - X nA )j3, where 

P = sgn(X (n+1)A - X nA ) ■ length {[X nA , X (n+1)A ] n j]). 

Let 7 = A'( n+1 ) A — X nA — ft. The contribution of a single boundary crossing summand equals 

j(-^(n+l)A — X nA ) 2 — -(A( n+1 ) A — X nA )P = -(/J + 7)(7 — P) = ————-■ 

Considering all four possible crossing configurations, we obtain that 

JV-l 

~ V’j) = (l( J.,i](^(n+1)A) ~ ^(4,1] (^a))- 

n =0 

■ ((a ( „ +1)a - j) 2 - (x nA - i) 2 ) 

N—l 

+ ^2 (l(i^i,l](-^(n+l)A) — l(2yi j i](A nA ))- 

n—0 

■{(X [n+ 1 )A -i^f~(X nA -^) 2 ). 

Thus, (1561) indeed follows from Theorem IT?! □ 

Proof of Theorem\8 1 Set e > 0. Let 72-3 be the high probability event introduced in Proposition 
[371 In view of Remark [331 it is enough to prove the claim for the estimator <r|. By Lemma [531 and 
since J ~ A -1 / 3 , it is sufficient to show that for any [aJ\ < j < \bJ~\ holds 

’ IKuui/jj) - fiu^tfj)]] < A 2/3 . 

By Definition ITT! holds f(ui,ifj) = ui t jf(ij)j,ipj) = u\ t jf(I, tfj). Since on the event IZ 3 , for A 
sufficiently small, the increments |X(„ + i) A — X nA \ < J^ 1 , we have 

l(ui, 1 pj) = Uij-il(lfj,lfj-i) +Ui t jl(lpj, 1 pj) +Uxj + 1 l(lpj, 1 p j+1 ), 

K 1 , $3) = + KV’j. i’j) + 

Consequently, since by Proposition [33] u\,j ~ 1, we deduce that 

Kui ,ipj) - ^l(ifj, if j- i )( u ^~ 1 - i) + ~ T( I ^j)+ 

+ T(^^ j+ i)(^-l)- (57) 

By the Cauchy-Schwarz inequality together with Proposition [33] and the inequality (1471) we can 
uniformly bound the mean absolute value of the first and third term by A 2 / 3 . Since IZ 3 C 72 .1 the 
mean absolute value of the second term is bounded in Lemma [531 □ 

3.6 Technical results 

We devote this chapter to the proof of two technical results that provide us with control over, 
properly rescaled, mean number of crossing of a given level a. 

Definition 40. For a £ (0,1) and n = 0,..., TV — 1 define 

x( n , a) 1 [o,a)(A”( n _|_i) A ) t[o,a)(A nA ). 
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The random variable \ codifies the event of the increment X nA , A(„ +1 )a crossing the level a. The 
sign of x contains information about the direction of the crossing. Since 


I x(n, a)| < 1 (|A„ a - a\ < w(A)), 


arguing as in the proof of Lemma (1181) we can show that 


E lx(«.a)l < 4w(A)/xi. 

n—0 

Consequently, Theorem 1171 implies that the mean number of crossings, rescaled by the sample size, 
can be upper bounded by A 1 / 2 log(A). Keeping in mind that (A*(„ +1 )a — A„a) 2 is of the order 
A = 1 / N , the next result is a refinement of the bound above. 

Theorem 41. For every a £ (0,1) we have 


N -1 


E, 


a,b 


(E \x(n,a)\(X {n+1)A -X nA ) 2 ) ]’ <A 1/2 


n —0 


Proof. Fix a £ (0,1). Since|x(n, a)| = 1 if and only if the increment (X nA , X( n+1 ) A ) crosses the 
level a, the claim is equivalent to the inequalities: 


E<t,6 

Ect,6 


iv — x 2 — 

( l(X nA < a)l(A'( n+1 jA > Qi)(-X"(n+ 1 )A — XnA) 2 ) 

n =0 

N-l 2 i 

( E ^(XnA > a)l(A' (ri+1 )A < aO(A(„ +1 )A - AT^a) 2 ) 


< A 1 / 2 , 


< A 1 / 2 . 


n =0 

Below, we only prove the first inequality. The second one can be obtained in a similar way or 
by a time reversal argument. Denote 

T]n = l(-^nA ^ ^)l(-^(n+ 1)A ^)(-^(n+l)A -^tia) • 


We have 

N—l 


N—l 


N—l 


E ( 


r, ^[( ^ v l(-^nA ^ ^)l(-^(n+l)A ^ ^)(-^(n+l)A -^-nA) ^ ^ ^ ^cr,b [Vn\ "F2 ^ ^ ^a,b[TjnTJm\’ 


n —0 


n =0 


0 <n<m 


Denote by p t the transition kernel of the diffusion X. Uniform bounds on diffusion coefficients 
imply that 

Pt {x,y)< Mi 4= exp ( - ^ V ^ 


Mot 


-)• 


(58) 


with Mi, M 2 positive constants uniform on 0, see [261 , Lemma 2]. From (f58|) and the inequality 
P, Formula 7.1.13]: 


dz < 


follows that 

r a rl 


f 

Jo 


0 J ol 


PA{x,y)(y - x) 4 dydx < 

< A 2 


0 J cx. 


e x < ^ e -« 
; + ^Jx 1 + 4/7T 2 


1 _ (y X , \d-7 7 ^-a9 

__ e ca (y — x) dydx < 

l — x 

VA c z 2 


(59) 


1 —a: 

e -2 z 4 dzdx 

V Ac 


/■“ /’vs; 2 2 „ /■“ («-*) 2 ... 

/ / e 2 dzdo; < A 2 / e 2cA da; < A 5 / . (60) 

Jo J Jo 

x/Ac 
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Similarly 


pa pi 

/ / PA{x,y)(y - xfdydx < A 3/2 . 

Jo J a 


(61) 


For simplicity we will use the stationarity of X , which is granted by Assumption [2] Using more 
elaborated arguments the result could be obtained for an arbitrary initial condition. By stationar¬ 
ity, for any f, the one dimensional margin X t is distributed with respect to the invariant measure 
p(x)dx. Conditioning on X nA , from (1601) and uniform bounds on the density p follows 

pa pi 

E CT ,b[r/ 2 ] = / / PA(x,y)(y - x) 4 dyp(x)dx < A 5/2 . 

J 0 J a 


Hence 


TV-1 


E E °Av 2 n\ <NA* = A>. 


n —0 

The Cauchy-Schwarz inequality implies 

TV—2 TV—2 


E E CT)6 [7?„r?„ + i] < ^ < N A^ < A>. 


n=0 n—0 

Finally, using (IfTlT) . for m > n + 1, we obtain 
/»1 />1 


/'Q; p i pcx pi 

E aAVnVml = / / / PA(x,y)(y-x) 2 p {m _ n _ 1 )A (z,x)(z-w) 2 p A (w,z)p(w)dydxdzdw 

Jo J a J 0 J a 

r a r l 


< 


rex pi pa pi 

/ / PA{x,y){y - x) 2 dydx == / / {z - w) 2 p A (w, z)dzdw 

Jo Ja [m — n — 1)/a J 0 Ja 


' 0 J a 

< A 5 / 2 


yjm — n — 1 


Consequently 


TV-3 TV-1 


TV-3 TV—n—2 


TV-3 


n—0 m=n+2 


E E E ^[M^A 5/ T E 7? ^ A ' 5/2 E VlV-n^2 

n—0 fc=l n—0 

TV-2 

= A 5 / 2 A 5 / 2 A 3 / 2 = A. 


□ 


Note that the claim of Theoremim still holds when we replace (X( n+1 )A — AAa) 2 by (A'(„ +1 )a — 
a) 2 or (X nA — a) 2 . Next, we show that, when considering the direction of the crossings, cancella¬ 


tions occur that make the difference of Y^n=o x( n i a)(X( n+ i)A — c*) 2 and Y^,n=o x( n , «)(^nA-a) 
even smaller. 

Theorem 42. For any a £ [j, 1 — -j] we have 

N -1 „ i 


JV-1 


E 


'a ,b 


1 ^i ' E x(n,a)((X {n+1)A - a) 2 - {X nA - a) 2 ) 2 < A 2/3 . 


n—0 


Due to the sign of the terms the proof of the next theorem cannot be done in a similar way as 
the previous result. In what follows we show that on the event TZ\ 


N-l 1 N-l 

E x(n,a)((X {n+ 1 )A - a) 2 - (X nA - a) 2 ) = / 1(X S < a)ds - — E W^A < a) + 

n=0 , ' 0 n -0 


R, 
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where the remainder term is of the right order. Thus we are left with showing that 


E ( 


(7,6 


N—l 


1 (X S < a)ds - - ^ (!(*"* < a) 2 < A 2/3 - 


n —0 


2 i i 


(62) 


Note that jt y~)„_ n (1 (X„a < a) is a Riemann type estimator of the occupation time of the interval 
[0,a). The problem of establishing the rate of convergence was recently considered in 22, fl9| . 
Although obtained results do not apply as they require higher smoothness of the coefficients, 
they suggest an ever better rate A 3 / 4 . Indeed, in the case of reflected diffusion with bounded 
coefficients, we can show that 


E 


£7,6 


f(X s )dsf(X nA ) 


N—l 


N 


71=0 


2-i 


< 


Iff*) 


for any cadlag function / with Sobolev regularity 0 < s < 1, see [4|. 

Proof. Fix a £ y, 1 — y]. On the event TZi, whenever l[o, a ) (A(„_(_i)a) — l[o,a) (A„a) / Owe must 
have |X„a — a|, |-X( ra +i)A — a\ < w( A) < A 4 / 9 . Consider function d : [0,1] —>■ R given by 

d(x) = (x — a) 2 l(|x — a\ < A 4 / 9 ). 


We have 


(l[0,a) (A( n _}_i)A) l[0,a) (A/ a)) ((A/ n _|_i)A (V n A Cv) ) 

= (l[0,a) (A( n _|_r)A) I [0,a) (A^a)) (d(A^ n _j_i^A) d(X nA ')') . 


Step 1. We will first show that 




Eo-,6 


N-1 

| ^ ^ 1[0,q:) (-^tia) (d(-AT( n +l) a) d(A n A)) 

71 = 0 

! n i 

< A 2 / 3 . 

(63) 

Note that 

d'(x 

) = 

2(x — a)l(\x — a\ < A 4 / 9 ), 





) = 

-A 4/9 (5{ 0 ,_ A 4/9} + l(|x — a| < A 4 / 9 ) - A 4 / 9 <5{ a+A 4/9}, 



where the second derivative must be understood in the distributional sense. Since we fixed a 
separated from the boundaries, d'( 0) = cf(l) = 0 for A small enough. Denote by 

L S A X ) : = L t(x) - L s (x), 

the local time of the path fragment (X u ,s <u<t). From the Ito-Tanaka formula [25], Chapter 
VI, Theorem 1.5] follows that 


d(X {n+ i )A ) - d(X nA ) = / d\X s )a(X s )dW t + / d’(X s )b(X s )ds+ 

J 71A J 71A 

/*(ti+1)A 

+ / a 2 (X s )l(|X s - a| < A 4 / 9 )ds - A 4 / 9 L nA ,(n+i)A(a - A 4/9 ) 

JnA 

/»(ti+1) A 

- A 4 / 9 L„A,(n+i)A(a + A 4 / 9 ) := / d'(X s )a(X s )dW t + D n . 

J 71A 


First, we will bound the sum of the martingale terms. Since martingale increments are uncorre¬ 
lated, using Ito isometry, we obtain that 


E 


'£7,6 


i-i 1 /*(n+1)A 

V l[o,a)(A„ A ) / d\X s )a(X s )dW t 

U n—0 
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N ~1 r(n+ 1)A 

1 [0,a)(^nA) / ( d'(X s )a(X s )) 2 ds 

n—0 JnA 


< A*E a , b 


1(|A S — a\ < A B )ds 


Jo 


= A5 


a+A9 


-A 9 


Ecr,i,[^i(a :)]dx < As, 


where the last inequality follows from (1131) . Now, we will bound the sum of the finite variation 
terms: Y2n=o l[o,a)(^nA)^n' Note first, that since b is uniformly bounded, we have 


N-l n(n+ 1)A rl 

E l[o,a)(A„ A )| / d'{X s )b(X s )ds | < A 4/9 / l(|x-a| < A 4/9 )m(x)dx < A 8/9 ||^i|| 00 . 
n=0 JnA Jo 

Since by the inequality (1131) ||/ii||oo has all moments finite, the root mean squared value of this 
sum is of smaller order than A 2 / 3 . Now, note that since on the event TZi w(A) < A 4 / 9 , condition 
A„a < a. implies i nA y n+1 ) A (a +A 4 / 9 ) = 0. On the other hand, whenever L„ A .( n+1 )/\(a —A 4 / 9 ) ^ 
0 we must have A„a < a . Hence 


N-l 


y l[0,a)(A„ A )(A 4 / 9 L„A, ( „ + 1 ) A(a - A 4/9 ) + A 4 / 9 L„A,(n+i)A(a + A 4 / 9 )) = A^L^a - A 4 / 9 ). 


n—0 


Using first the Cauchy-Schwarz inequality and then the regularity of the local time (see |2f 
Chapter VI, Corollary 1.8 and the remark before]) we obtain 


E, 


<a,b 


A 4 / 9 L 1 (a - A 4/9 ) - / Li(x)dx | 2 ] < A 4/9 / E a , b [\ L i(x) - L^a - A 4/9 )| 2 }dx 


l-A 4 /9 


i-A 4 /9 


< A 4 / 9 r I a - (a - A 4 / 9 )| dx < A 4 / 3 . 

Ja- A 4 /9 


Consequently, to prove (1631) we just have to argue that the root mean squared error of 

ncn N—l r{n-\- 1)A 

/ L x {x)dx - y l [0 ,„)(X nA ) / a 2 (A s )l(|A s -a\< A 4 / 9 )ds 

J a—A 4 / 9 n= o **nA 


N—l «( n +i)A 


= E 

n=0 


(1(X S < a) - l(X nA < a))a 2 (X s )l(\X s - a| < A 4 / 9 )ds 


nA 


N — l p{n-\- 1)A 


= E 

n=0 


nA 


(1(X S < a) - l(X nA < a))a 2 (X s )ds 


(64) 


is of order A 2 / 3 . From the Lipschitz property of tx 2 follows that 


N-l r (n+ 1)A 


E 

n—0 


(1(X S < a) - l{X nA < a))(cr 2 (A s ) - cr 2 (a))ds 


nA 


< 


,a+A 4 / 9 


N—1 /*(n+l)A /»a-|-zA ' 

<y 1(\X S - a\ < A%)A%ds < A% pi(da:) < As ||^ 1 || 00 . 


n—0 


I nA 


Thus, by (fT3l) . we reduced (1M1) to 


Af-l 


1(A' S < a)ds - — E (l(-X"nA < a), 


n—0 


which is of the right order by (El. We conclude that (El holds. 
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Step 2. Consider the time reversed process Y t = Ai_ t . Since X is reversible, the process Y, 
under the measure P a ,b, has the same law as A'. Furthermore, the occupation density and the 
modulus of continuity of processes Y and X are identical, hence 1Z± is a “good” event also for Y. 
Inequality (1631) is equivalent to 


E 


a,b 



N—l 

^ ^ l[0,a)(^mA)(^(^(m+l)A) a)) 

m —0 



< A3. 


Substituting n = N — m we obtain 


N-l N -1 

^ ^ l[0,a)(^mA)(^(^(m+l)A) a)) = ^ ^ l[0,a) (-^(n+l)A) (^(-^(n+1 )a) ^(-^nA))* 

ra=0 n=0 


□ 


4 Low-frequency analysis 

4.1 Spectral estimation method 

In 1998 Hansen et al. [l2j explained how the coefficients of a diffusion process are related to the 
spectral properties of its infinitesimal generator. In this section we want to shortly introduce the 
main idea of their method. 

The generator L of the reflected diffusion A' is an unbounded operator on L 2 with 

dom(L) = {/ G H 2 : /'(0) = /'(l) = 0} 

Lf{x) = S-~ 1 {x)(^a 2 (x)p,{x)f\x))', for f G dom(T). 

Spectral properties of L are discussed in the appendix Section [Bj Seen as an operator on the 
equivalent Hilbert space L 2 (/i), the generator L is elliptic, self-adjoint and has a compact resolvent 
operator. Consequently, the eigenproblem 

Eigenproblem 43. Find (£ ,u) e R x L 2 , with u ^ 0, such that 

Lu = (u. 

has countably many non-positive eigenvalues 0 = (jo > Ci > (2 > ..., with p—orthogonal 
eigenfunctions (tq)i=o,...- The eigenvalue (1 is simple and the corresponding eigenfunction u\ is 
strictly monotone, see Proposition 1531 The main idea of the spectral estimation method is that 
the diffusion coefficient er 2 can be expressed in terms of the invariant density /i and the eigenpair 
(Ci, til) (c.f. Hansen et al. (T^, Eq. 5.2]): 

„’(*) = {65) 

u^nix) 


4.2 Estimation error of the invariant measure 


From now on we take the Assumptions [5] and [T] as granted. Fix A > 0 and 0 < a < b < 1. Set 
J ~ TV 1 / 5 . Since the generator L has a spectral gap, diffusion X is geometrically ergodic. Below 
we state general bounds on the variance of integrals with respect to the empirical measure /T/v, 
which are due to the mixing property of the observed sample (A' n A)n=o,...,JV- For the proof we 
refer to Chorowski and Trabs [5|, Lemma 9]. 

Lemma 44. For any v,u £ L 2 ([0,1]) we have 


Vfu>.h 

N-l 


/ v(x)f2 N (dx) 
To 


Var ^ in 


^ ^ v(X nA )u(X {n+1)A ) 


n —0 


<^->11^, 

<N-'\\vP A u\\ 2 L2 
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Corollary 45. There exists a high probability event 71, with P CT ^(fl\7i) < N 1 J 2 , such that, for 
any 1 < j < J, on 71 holds 



J2 N {dx) 


1. 


Proof. Since the invariant density /r is uniformly bounded on 0, there exist constants 0 < c < C 

3_ 

s.t. c < J f /_i n(x)dx < C. Let 


Ti = {Vj 


= 1, 


, J holds 


/ ip'Jx)jj,N(dx) — / ip'Jx)n(x)dx 

Jo Jo 


< — ). 

“ 27 J 


Using first the Markov inequality and then ||i/i' || 2 2 = J 1 with ||^||| 2 = J 1 we conclude that 
the claim holds. □ 


4.3 Proof of Theorem [9] 


First, we state the approximation properties of the spaces Vj. 

Definition 46. Denote by nj and nf the L 2 and L 2 (p,)— orthogonal projections on Vj respectively. 

Since Vj is the space of linear spline functions with regular knots at {0, j, -j,..., 4=1, 1}, it 
satisfies the following Jackson and Bernstein type inequalities: 

II (/ - < J- (2 - fc) “||/|| cl „, for / e 1]) and k = 0,1, (66) 

Ikllffi < j IMIl 2 for V e Vj. (67) 

Definition 47. Denote by {<t>j)j=o,...,J the Franklin system on [0,1], i.e. the L 2 —orthogonal 
basis of Vj, obtained from the Schauder algebraic basis by the Gram-Schmidt orthonormalization 
procedure. 

For construction and properties of the Franklin system we refer to Ciesielski [6]. In particular, 
basis functions (4>j)j satisfy the following uniform bound (cf. Ciesielski [gj, Theorem 5]): 


E‘ 

3=0 


< J. 


( 68 ) 


Proof of Theorem 03 As noted in Section eh the estimator ((Utti) is constructed in the exactly 
same way as the eigenpair estimator in Gobet et al. (Tl| , Cliorowski and Trabs JB;]. Given the 
properties of the Franklin system, arguing as in Chorowski and Trabs [Bl, Corollary 18], we obtain 
that there exists a high probability event 72, with P CTj &(D \ T 2 ) < IV -2 / 5 , such that 


E CT , b [l T2 • (| 7l - Ci | 2 + ||«i - will^i)] 5 < iV" 1/5 . 


(69) 


Furthermore, on the event T 2 , we have |ui| ~ 1 and ||Mi||i/i < 1. 

Before we can prove the upper bound on the estimation error, we need to face one more technical 
difficulty. Since the estimator u\ converges to the eigenfunction u\ in the sense of mean H 1 norm, 
we can’t postulate a uniform positive lower bound on inf xe r a w u'^(x). Following Chorowski and 
Trabs [Bj, Lemma 19], this difficulty can be overcome by applying the threshold cr| A D. We 
conclude that there exists a high probability event T-j C T 2 D 71, with P CT!b (D \ 71) < N~ 2 ^ 5 , such 
that on 71, for j s.t. [^-, j] C ( a,b ), we have 


°s,j AD = 


-2Ci /p 1 i>j{x)ui{x)fi N {dx) 
(uij V c a ,b) fg i/>j(x)jx N (dx) 


A D, 


(70) 


for a deterministic constant c a ,b > 0 satisfying c a ,b < inf^gja,;,] w) (x). 

Having established (IMIl and (I7H1) . the plug-in error can be bounded by similar considerations 
as in the proof of Chorowski and Trabs [5|, Theorem 7]. □ 
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A Construction and properties of a scalar diffusion with two 
reflecting barriers 


In this section, we construct a weak solution of the SDE ([!]). In the next sections we will use 
the presented construction to generalize properties of scalar diffusions to reflected processes. The 
main idea of the following reasoning is to extend the diffusion coefficients b and er to the whole 
real line, apply general SDEs theory to obtain a solution on R and finally project this solution to 
the interval [0,1] in a way that corresponds to the instantaneous reflection. We refer the reader 
to flfll . 1.23] for a very similar construction of a diffusion on[—1,1] with two reflecting barriers. 

Definition 48. Define / : R —» [0,1] by 


/ (x) = 


x — 2n 


: 2n < x < 2n + 1 


2 (n + 1) — x : 2n + 1 < x < 2n + 2 
Function / is almost everywhere differentiable with the derivative 


, for n £ N. 


f'(x) 


1 : 2n < x < 2n + 1 

— 1 :2n + l<a;<2n + 2 


For cr, b : [0,1] —> R we define the extended coefficients <r, b : R R by 

b(x) = f(x)-bof(x) 
a(x) = aof(x). 

Theorem 49. Grant Assumption Q] For every initial condition xo £ [0,1] that is independent of 
the driving Brownian motion W the SDE 

dY t = b(Y t )dt + a(Y t )dW t , (71) 

Yo = Xo, 


has a non-exploding unique strong solution. Define 


X t = /(b)- 


The process ( X t ,t > 0) is a weak solution of the SDE 171) . 

Proof, b is bounded and a' £ Ig ( R). Hence, the existence of a unique strong solution (Y t ,t > 0) 
of the SDE (1711) follows from |28l . Theorem 4]. As discussed in the proof of (l8l . Chapter 5, 
Proposition 5.17] the boundedness of b prevents the explosion of the solution. Process Y is a 
continuous semimartingale, hence by ( 25 L Chapter VI Theorem 1.2] it admits a local time process 
{Lj,t > 0). By the Ito-Tanaka formula ( fldi. Chapter VI Theorem 1.5]) process X satisfies 


X t = 


xo+ [ b(Y s )f'(Y s )ds + f 5(b)/'(b)dW 5 + ^Lf(2n)-^Lf(2n + l) 

, ' 0 •'° nez nez 

Xq + f b(X s )ds + f cr(X s )dB s + Kf. 

Jo Jo 


where B t = f* f(Y s )dW s and K t = J^ neZ Lj (2n) — + !)• Note that for any T > 0 

the path (X t ,0 < t < T) is bounded, hence K is well defined. Process B is a martingale with 
quadratic variation 

(B) t = [ ( f(Y s )) 2 ds = t. 

Jo 

Hence, Levy’s characterization theorem implies that B is a standard Brownian motion. From 
the properties of the local time L\ follows that K is an adapted continuous process with finite 
variation, starting from zero and varying on the set \J nGZ {Y t = 2n}U{Yt = 2n+l} C {X t £ {0,1}}. 
Consequently, X satisfies the SDE JT]). □ 


35 


Next, we use the above construction of a reflected diffusion process to prove Brownian bounds 
on the moments of the modulus of continuity of X. 

Proof of Theorem 1 1 71 Fischer and Nappo [Z] proved claimed upper bound for the standard Brow¬ 
nian motion. We will now generalize their result to diffusions with boundary reflection. 

Step 1. Consider a martingale M satisfying dM t = cr(X t )dWt. By the Dambis, Dubins-Schwarz 
theorem, M t = Bp a 2( Xu )du f° r some Brownian motion B. Consequently, 

\M t — M s \= \Bp a 2 ( X p du — Bj* <T 2 (x u )du| < u B {\t — s|||cr 2 ||oo), 

where oo B is the modulus of continuity of B. Thus, m holds for the martingale M, with a 
constant that depends only on the uniform upper bound on the volatility cr. 

Step 2. Consider a semimartingale Y satisfying dY t = b(X t )dt + dM t . Then 

\Y t -Y s \ < | [ b(Y u )du — 

Jo 

Consequently, m holds for the semimartingale Y, with a constant that depends only on the 
upper bounds on cr and b. 

Step 3. For (cr, b) £ 0 consider the reflected diffusion process X satisfying the SDE . Let 


/' 


b(Y u )du\ + \M t -M s \ < |t-s|||6||oo + cu M (|t-s|). 


dY t = b(Y t )dt + a(Y t )dW t , 
X t = f(Y t ), 


where b, a and / are as in Definition 05] From Step 2, it follows that m holds for the semi¬ 
martingale Y with a uniform constant on 0. By the construction of the reflected process X we 
have |X S — X t \ < \Y S —Y t \. We conclude that oj x < ui Y , hence the claim holds for the reflected 
diffusion A'. □ 


B Bilinear coercive form 

Recall that H 1 ,H 2 denote the L 2 — Sobolev spaces on [0,1] of order 1 and 2 respectively. For 
differentiable, strictly positive functions cr and p, consider an elliptic operator T on L 2 ([0,1]), with 
Neumann type domain dom(T) = {u £ H 2 : ?/(0) = r/(l) = 0}, given in the divergence form by 

= J^xMxWWY for „ e dom(r) (72 ) 

2/i(x) 

Note that the operator — T is an infinitesimal generator of the diffusion process on [0,1] with 
instantaneous reflection at the boundaries, volatility function cr and an invariant measure with 
density /r. We want to analyze the eigenvalue problem for T, i.e. 

Eigenproblem 50. Find (A ,w) fix dom(T), with w ^ 0, such that 


Tw = Aw. 


Integrating by parts, one can check, that the eigenpairs of the Eigenproblem [50] solve 
Eigenproblem 51. Find (A,u>) £ K x H 1 , with w ^ 0, such that 


/ w'(x)v'(x)a 2 (x)/j,(x)dx = 2A / w(x)v(x)fi(x)dx for all v 
Jo Jo 


£ H L 


(73) 
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Eigenproblem [51] is a weak formulation of the Eigenproblem [50] for the associated Dirichlet 
form l(u,v) = ( Tu,v) fJj . The biggest advantage of the weak formulation is that the Eigenproblem 
EH makes sense for any, not necessarily regular, functions /i. When /x is not differentiable, the 
Eigenproblem [SO] has no longer probabilistic interpretation in terms of the infinitesimal generator. 
Nevertheless, such problems arise naturally when one considers spectral estimation method with 
fixed time horizon, when the role of the invariant measure is taken by the non differentiable 
occupation density. 

In what follows, we want to generalize the results of 0 on the spectral properties of an 
infinitesimal generator, to the solutions of the Eigenproblem EH with a Holder regular function /x. 
For 0 < a < 1 denote by C a the space of a—Holder regular functions on [0,1]. Furthermore, for 
k £ N let C k ’ a be the space of /c—times differentiable functions with k th derivative in C a . 

Definition 52. For any given 0 < d < D let 

:= {(*,/*) € H 1 ([0,1]) x C7“([0,l]) : |M| C « < A 

inf (cr(x) A /x(x)) 
xe[o,i] 

Eigenproblem EH is a conforming eigenvalue problem for a bilinear coercive form on the Hilbert 
space L 2 (/x). Q is a standard reference. 

Proposition 53. Let (cr,/x) £ O a . The Eiaenvroblem [571 has countably many solutions (, 
with real nonnegative eigenvalues 0 = Ao < Ai < A 2 < ... and /x— orthogonal eigenfunctions, 
satisfying Neumann boundary conditions w[(0) = iy((l) = 0. The smallest positive eigenvalue Ai 
is simple, the derivative w[ of the corresponding eigenfunction is 1/2 A a Holder continuous and 
strictly monotone. 

Proof. It is easy to check that for any (a, /x) Ao = 0 and wq = 1 form an eigenpair. Let Lq(/x) = 
{v £ L 2 (/x ) : f v(x)/i{x)dx = 0} and I7q(/x) = Lq(/x) bl H 1 . Lq(/x) with the L 2 (fi) inner product 
and with (u,v)h i(/x) = (u,v) L 2 ^ + u'(x)v'(x)p(x)dx are Hilbert spaces. The identity 

embedding I : Hg(/x) —>■ Lq(iT) is compact. 

For u,v £ let 

l(u,v) = / u'(x)v'(x)a 2 (x)n(x)dx. 

Jo 

l is a symmetric positive-definite bilinear form on Hq (/x) x Hq(p). Furthermore, for any u £ Hq (/x) 
holds 

c ll u llff^(/i) — K u > u ) ^ C|l u ll(74) 

for some constants 0 < c < C that depend only on d , D. Indeed, since er and /x are uniformly 
bounded, we only have to show that f^u 2 {x)dx< f Q (u'(x)) 2 dx. Consider u £ C 1 ([0,1]) fl i?g(/x). 
Since u is continuous and integrates to zero, there exists Xq £ [0,1] s.t. u{x 0 ) = 0. Since 
u(x) = f* u'(y)dy , the upper bound ||xx||l 2 < ||i* / ||i 2 follows from the Cauchy-Schwarz inequality. 
As continuous functions are dense in H 1 , we conclude that (I7H) holds. 

I is the Dirichlet form of an unbounded operator T on L/(/x). Define D = dom(T) as these u £ 
Hq(/T), that the functional v 1 —> l(u , v) is continuous on Hq (/x) with norm || ■ || - By th e definition 

of the weak differentiability, domain D = {u : Hq(p) : u'a 2 p, £ H 1 }. Furthermore, D is dense 
in Lq(/x) (see Exercise 4.51]). For u € D, we define Tu via the Riesz representation theorem 
by l(u,v) = (Tu,v) Such defined T is an elliptic, densely defined, self-adjoint operator with 
compact resolvent (see [3!, Proposition 4.17]). Consequently, T has a discrete spectrum (Ai)j = i ) ..., 
with all eigenvalues positive and corresponding eigenfunctions /x—orthogonal. 

Integrating by parts the right hand side of (1731) . we obtain 


> 


d, J /x(a :)dx = 1 j 


./o 


x)a 2 (x) pi{x)v' (x)dx = —2A, 


fr 


w i{y)t L {y)dyv'{x)dx for all v £ H l . 


37 


Since {V : v £ H 1 } is dense in L 2 , it follows that 


w[{x) 


2A iff Wj(y) fi(y)dy 
a 2 {x)y(x) 


(75) 


By Sobolev embedding a 2 is 1/2—Holder regular. Consequently w[ lies in C 1 / 2A “. Since the 
eigenfunctions y —integrate to zero, we deduce that w'(0) = w'(l) = 0. 

Finally, we need to show that Ai is simple and that w\ is strictly monotone. By the variational 
formula for the eigenpairs of a self-adjoint operator 


ff {u' (x)) 2 a 2 (x) y{x)dx 
2Ai = inf V ,hK ’ 

ueH^(ii) u 2 (x)y(x)dx 


(76) 


Arguing as in (ill . Lemma 6.1], we obtain that [j u 2 {x)y{x)dx = Jo Jo m{y , z)v!(y)u'(z)dydz with 
m(y, z) = ff^~ y(x)dx ff Vz y{x)dx. We deduce that the derivative of the eigenfunction w\ must 
have a constant sign, otherwise we could reduce the ratio in (1761) by considering 


wi = wil^i > 0 ) — wil(w[ < 0 ). 


Hence, the set {a; : w[(x) = 0} has zero Lebesgue measure. From (1751) follows that w[(x) = 0 
only for x = 0,1, meaning that w\ is strictly monotone on (0,1). Consequently, for any two 
eigenfunctions w\ and w \, which correspond to Ai, the scalar product 


/ w 1 {x)w 1 {x)y(x)dx = / / m(y, z)w[{y)w[(z)dydz ^ 0, 

Jo Jo Jo 

hence the eigenspace corresponding to Ai is one dimensional. □ 

Proposition 54. The eigenvalues Ai, A 2 and the norm ratio ||ud||c?i,i/ 2 A<*/||iui||.L 2 Qq ar e uniformly 
bounded for all (a, y) £ O a . Furthermore, for every 0 < a < b < 1, inf^gu;,] |u>((x)| and the 
spectral gap A 2 — Ai have uniform lower bounds on 0 a . 

Proof. We adapt the notation from the proof of Proposition [531 Choose w\ normalized s.t. 
Il^i 11 l 2 (/z) = 1- We will first argue that Ai,A 2 and ||wi|| C ;i,i/ 2 ac« are uniformly bounded on 0 Q . 
From d we imply that 

Ai = l{w i,wi) > c||wi||^i ( m) > c, 

with c > 0 depending only on the bounds on a and y. It follows that the eigenvalues are uniformly 
separated from zero. By the variational formula 


2A2 — 


. fo (u'(x)) 2 a 2 (x)y(x)dx 

mi sup -- j -< 

Sch 1 ^es fo U 2 (x)y(x)dx 

dim(S')—3 


inf 

Sch 1 

dim(S)=3 


° 3 fo( u '( x )) 2dx 
sup -- 

ues d f n u 2 (x)dx 


< 4-7T 


d ’ 


since 47 t 2 is the third eigenvalue of the negative Laplace operator on L 2 ([0,1]) with Neumann 
boundary conditions. We conclude that the eigenvalues Ai and A 2 are uniformly bounded. The 
uniform bound on ||rui|| c>i,i/ 2 aq, follows from the representation (1751) . 

We will now prove a uniform lower bound on the spectral gap A 2 — Ai. Assume by contradiction 
that for some sequence of coefficients {a n ,y n ) £ 0 a the corresponding spectral gaps (A n! 2 — A n! i) 
converge to zero. Since 0 a is compact in the uniform convergence metric, we can assume that 
(crnhn) converges uniformly to some (<r, y) £ 0 a . We will argue that the uniform convergence of 
the coefficients leads to convergence of the eigenvalues, hence contradicts Proposition [53] (cf. fill 
proof of Proposition 6.5]). However, since the function y is embedded in the definition of spaces 
Lo(y) and i7g(/i), we need first to reduce the Eieenproblem I5T1 to a universal function space. 


38 






Let U(x) = f Q l y(y)dy be the distribution function of y. Substituting U(x) = y, we find that 
the Eigenproblem ED is equivalent to 

f w'(x)v' (x)d 2 dx = 2A / w'(x)v'(x)dx for all v £ H 1 
Jo Jo 

W = wolf- 1 , 

with a = (cry) o U~ l . Consider ( d n ) n and cf corresponding to (cr n ,y n ) and (a, y) respectively. 
Note that d n converges to d in the uniform norm. Denote Lq = Lq( 1) and Ft) := H)(l). For 
u, v £ H) denote 

l n (u, v) = / v!(x)v'(x)d n (x) 2 dx 

Jo 

and by T n the corresponding operators on L). Recall that the operators T n are unbounded and 
self-adjoint on L), with dense domains D n . Domains D n do not have to possess a common core, 
which is needed to study the convergence of the sequence (T n ) n . We circumvent this difficulty by 
introducing inverse operators R n = Tff 1 . Using the divergence formula (17511 for T ni we check that 
for u £ Lq 

rv 

(77) 


f x ~ f y 

R n u(x) = -2 / d~ 2 (y) / u(z)dz + c n (u), 

Jo Jo 


where c n (u) £ R is such that / 0 ' R n u(x)dx = 0. The convergence a n —> a in C ,1 [(0,1)] implies that 
operators R n converge to R in the operator norm on L\. By [3, Proposition 5.28] this entails the 
regular convergence, which, by [g, Theorem 5.20], is equivalent to the strongly stable convergence. 
Finally, [s|, Proposition 5.6] ensures the convergence of the eigenvalues with preservation of their 
multiplicities. 

Set 0 < a < b < 1. We finally have to prove the uniform lower bound on inf X £[ a M |u^(a:)|. 
We will use the same indirect arguments as when bounding the spectral gap. Assume that for 
some sequence ( cr n ,y n ) £ 6> a , with (cr n ,y n ) converging in the uniform norm to (cr, y) £ 6> a , the 
corresponding eigenfunctions w\^ n satisfy inf„ inf xe r a w \w( n (x)\ = 0. Arguing as for the spectral 
gap, we reduce the problem to bounded operators ( R n ) n and R. From formula (1771) we deduce 
that the uniform convergence of coefficients implies R n —> R in the operator norm on C([0,1]). 
We conclude, that the eigenfunctions converge in the uniform norm, which contradicts Proposition 

m □ 

Eigenproblem 55. Let Vj be a finite dimensional subspace of L 2 . Find ( \j,wj) eRx Vj . with 
wj / 0 such that 

/ w'(x)v'(x)a 2 (x)y(x)dx = A / w(x)v(x)y(x)dx for any v £ Vj. 

Jo Jo 

Proposition 56. Let (Vj)j=i,... be a sequence of approximation spaces satisfying the following 
Jackson’s type inequality: 

||(/ - ttj)v\\ h i < for v £ C 1 '*, 


where i tj is the L 2 — orthogonal projection on Vj and C > 0 some universal constant. Furthermore, 
assume that every Vj contains constant functions. 

For (a, y) £ O a the Eiaenvroblem 1777 has dim(Vj) solutions ( Xj,i,wj t i)i with real eigenvalues 
0 = Ayo < Ayi < Aj : 2 < ... < Aydim(Vj)-i■ F° r J big enough, the eigenvalue Aj.i and the spectral 
gap Aj .2 — Ayi are uniformly bounded on 0 a . 

Proof. We adapt the notation from the proof of Proposition [53] By the Lax-Milgram theorem, 
there exists an isomorphism Si : H^y) —> Ft^iy) such that 

l(S t v,u ) = (v,u) H i^), for all v,u £ H)(y). 


39 


Note that since for any v £ Loip) the functional .Hq(/z) 9 u i —> (v,u)l 2 u\ £ R is continuous on 
by the Riesz representation theorem there exists a continuous operator J : Lq(/z) —► ^o(m) 
such that 

{v,u) L 2 (fJj) = (Jv,u) H i (fJj) . 

Define the operator Bi = Si o J o I, where I is the identity embedding of Hq(h) into Lq(/z). By 
d, the form / defines an equivalent norm on Note that Bi is a self-adjoint and compact 

operator on the Hilbert space with /—induced inner product. Consider (A i,Wi), a solution 

of the Eigenproblem [51] For any v £ H g (/z) we have 

l(wi,v ) = Xi(wi,v ) L 2 (/i) = A i{Jwi,v) H i(fj,) = A il(SiJwi,v) = l(XiBiWi,v), 

hence (A f ,wf) is an eigenpair of the operator Bi. In particular, Proposition 1531 implies that the 
biggest eigenvalue A^ 1 is simple. 

Denote by ttj the l —orthogonal projection on the subspace Vj. Define the operator B\,j — 
ir l jBiir l j. Since Bj j is a self-adjoint operator on Vj, with the /—induced inner product, it has 
dim(Vj) — 1 solutions (Awith the eigenvalues Aj-, > X~^\ > ... > Anal¬ 

ogously as for the operator Bi, we check that (A jy,wjy) are solutions of the finite dimensional 
Eigenproblem [55] From (l7Hl together with the uniform bound on fi follows that 

\\(I - TT l n ) Wl \\l < \\{I - -K l n ){I - -K^WxWl < 2||(J — 7Tj)wi||/ < C \\(/ - 7T j)W! || , 

for some, uniform on O a , constant C. Using Jackson’s inequality, the uniform bound on the Holder 
norm of uq and uniform bounds on the eigenvalues Ai, A 2 , we conclude that, for J large enough, 

||(/-7r^)u;i|| 1 < Al 6A _^ 2 ■ 

The claim follows from [H|, Theorem 25]. □ 
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